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Reopening schools during COVID-19 


oronavirus disease 2019 (COVID-19) is upending 
education. Operating schools during the pan- 
demic entails balancing health risks against the 
consequences of disrupting in-person learning. 
In the United States, plans differ among states as 
schools have already reopened or plan to reopen. 
Scientific understanding of severe acute respira- 
tory syndrome coronavirus 2 (SARS-CoV-2, the cause of 
COVID-19) should inform how schools reopen. 

Although school children and adolescents (ages 3 to 
18 years) can develop COVID-19, most remain asymp- 
tomatic or experience mild illness. These youngsters 
may be less susceptible to infection 
than older individuals but probably 
spread the infection at similar rates. 
SARS-CoV-2 infections in children and 
adolescents are rising faster than in 
other age groups as restrictions have 
been eased. Infections have been im- 
ported into schools from the commu- 
nity. But further transmission within 
schools has been rare when rigorous 
measures have been implemented to 
reduce the risk of person-to-person 
spread. Larger school outbreaks are 
associated with increased commu- 
nity transmission, insufficient physi- 
cal distancing, poor ventilation, and 
lack of masking. Schools that imple- 
mented transmission mitigation mea- 
sures (including in European countries) seem not to 
have substantially contributed to increased circulation 
of the virus among local communities. 

What can schools do? The evidence thus far points to 
three mitigation strategies for reopening. 

Minimizing the import of infections into the school can 
stem the spread of COVID-19. Daily symptom screening 
can identify individuals with COVID-19 at first presenta- 
tion. They should seek diagnostic testing. However, infec- 
tions can be silent. Approximately 15 to 50% of children 
and 10 to 30% of adults will either not notice symptoms 
while their immune system fights the infection (asymp- 
tomatic carriers) or become infectious 1 to 3 days before 
symptom onset (presymptomatic carriers). Current di- 
agnostic tests cannot identify silent infections reliably 
and are not sufficiently fast and inexpensive to make a 
school-wide testing-based surveillance system practical. 
Thus, the most effective tool for minimizing the risk of 
infections being carried into schools is to restrict in-per- 
son learning to when infection in the local community 
is controlled. Countries with widespread testing began 


“The evidence 
thus far points 
to three 


mitigation 
strategies for 
reopening. 


opening schools with rigorous safety measures in place 
when fewer than 30 to 50 new infections were observed 
within 7 days per 100,000 residents over a prolonged pe- 
riod. Countries providing in-person schooling with basic 
mitigation measures (i.e., distancing, face masks worn in 
hallways but not classrooms, hand hygiene, ventilation, 
and staying home with minimal symptoms) typically 
have close to zero community transmission. 

The likelihood of further transmission must be mini- 
mized if infections are brought into school. COVID-19 is 
spread through liquid particles containing the virus that 
are generated by breathing, speaking, shouting, singing, 
coughing, and sneezing. The rapid 
settling rate of large droplets under- 
lies recommendations for physical 
distancing, surface disinfection, ven- 
tilation, and hand hygiene. Because 
smaller liquid particles dispersed as 
aerosols stay airborne, it is not only 
the distance from another person that 
determines the risk of transmission, 
but also the duration of exposure. 
Limiting room occupancy, avoiding 
activities such as singing, and improv- 
ing ventilation are critical in transmis- 
sion control. Masks reduce spread by 
droplets and aerosols by limiting re- 
lease and inhalation. Airborne spread 
is much less likely outdoors, but 
sports, where proximity to excessive 
exhalation is intrinsic to the game, need to be avoided. 

Large outbreaks in school can be minimized by 
limiting secondary transmission to the smallest pos- 
sible number of persons. Cohorts that remain relatively 
isolated from each other can reduce person-to-person 
contact and can facilitate contact tracing if outbreaks 
occur. Early detection of infected individuals through 
symptom surveillance and diagnostic testing can limit 
quarantine measures to the affected cohorts, rather 
than having to close grades or the entire school. 

From these three efforts, a layered approach to risk 
mitigation in schools can be developed where measures 
with partial effectiveness are combined to reduce the 
probability of children, teachers, staff, and family mem- 
bers becoming ill with COVID-19.* 

The lower the infection rate in the community, the less 
stringent other risk mitigation measures need to be. 
If communities prioritize suppressing viral spread in 
other social gatherings, then children can go to school. 


—-Ronan Lordan, Garret A. FitzGerald, Tilo Grosser 


*The figure is available in the supplementary materials 
(science.sciencemag.org/content/369/6508/1146/suppl/DCl). 
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EDITORIAL 


Enshrining equity in democracy 


n 18 August 2020, the United States celebrated 
the 100th anniversary of the 19th amendment 
to its Constitution, which granted the right to 
vote to female U.S. citizens. This amendment 
had a profound, yet uneven, impact on the 
lives of female scientists and on the scientific 
enterprise at the time and into the 21st cen- 
tury, enabling white women in science to gain greater 
professional acceptance, to expand their opportunities 
for scientific work, and to fight for equal pay. At the 
same time, women of color did not receive the right to 
vote until 1965. The participation of women of color 
in scientific professions was thus se- 
verely limited during the interven- 
ing years; a disparity that continues 
today, and which may worsen as 
women throughout the country are 
being tasked with new and more 
extensive childcare, elder care, and 
household responsibilities as a re- 
sult of the coronavirus disease 2019 
(COVID-19) pandemic. 

As the distinguished historian 
Margaret Rossiter noted in the first 
volume of her path-breaking tril- 
ogy Women Scientists in America: 
Struggles and Strategies to 1940, in 
the period from 1880 to 1919, white 
women in the United States had 
begun to earn doctorates in scien- 
tific fields in greater numbers and to increase their 
presence in many leading scientific organizations. 
However, earning more doctorates did not necessarily 
lead to more desirable jobs, nor to an increase in the 
number of major publications. And even the most out- 
standing white female scientists frequently held lowly 
titles in universities and laboratories, if they held 
positions in these spaces at all. Some were relegated 
to women’s colleges, departments of home econom- 
ics, and separate women’s scientific clubs. Often, they 
were only recognized for their contributions to science 
decades after their achievements. 

Many of these women joined in the suffrage move- 
ment, with the idea that the vote would help to advance 
their progress in scientific fields, but they often failed to 
confront their own exclusionary practices, particularly 
those surrounding race. In not advocating for voting 
rights for all women, they helped to support the seg- 
regation of scientists of color within scientific institu- 
tions, especially female scientists of color. Indeed, little 
was done by leading scientists to address issues of race 
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to bridge gaps 
in child and 


elder care still 
falls mainly 
on women.” 


or the representation of women of color in science until 
after World War II. Even after decades of efforts to in- 
crease the diversity of the U.S. scientific workforce, we 
are still struggling with this legacy of exclusion today. 

It is apt that we reflect on the historical struggles 
of women and the disproportionate burdens borne by 
women of color now, at a time when many female scien- 
tists find themselves once more disadvantaged profes- 
sionally, as they assume greater familial responsibilities 
during the COVID-19 pandemic. Universities and other 
scientific institutions have never met the capacity to 
support the needs of all families, and the burden to 
bridge gaps in child and elder care 
still falls mainly on women. Most 
routinely piece together support 
by combining paid care services 
and help from family members, or 
compete for limited access to, and 
financial support from, institu- 
tional benefits. The pandemic has 
complicated this already difficult 
process and introduced new house- 
hold stresses. Online home-based 
education, for example, is poised to 
remain part of the education system 
from K-12 through college for the 
foreseeable future. These burdens 
cross lines of race, ethnicity, age, and 
class, but are likely to disproportion- 
ately affect women from groups that 
have been historically disenfranchised in science, tech- 
nology, engineering, and mathematics (STEM) fields, 
including Black and Latinx women, who have a long 
history of shouldering more family responsibilities than 
their white counterparts. 

Prioritizing the creation of a national, federally sup- 
ported, robust system for family care would represent a 
long-needed step toward justice and equity for women 
in science. Other developed countries have various pro- 
grams and policies in place, but much more research 
and more proposals for how to implement and support 
such programs are needed. If scientific institutions do 
not begin to address the issue of family support, the 
nation runs the risk of losing an entire generation of 
talented female scientists. We do not need a report writ- 
ten years from now bemoaning this loss. If we wish to 
create a more equitable future for all scientists, then 
now is the time to redress this long-neglected issue that 
hinders the full participation of women in STEM. 


—Evelynn M. Hammonds 
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Hurricane Laura’s double punch 


DISASTERS | When it roared ashore 

last week in Louisiana, Hurricane Laura 
packed a double whammy, endangering 
public safety with its wind and water and 
slowing efforts to stem the COVID-19 pan- 
demic. Its top wind speed at landfall, 

241 kilometers per hour, was the fifth 
highest documented for any U.S. hur- 
ricane. Laura tied a record for the fastest 
intensifying storm in the Gulf of Mexico, 
with its wind increasing on 26 August by 
105 kilometers per hour in just 24 hours; 
the causes of such rapid strengthening are 
little understood. The storm led to at least 
19 deaths in Louisiana and Texas. It also 
threatened to accelerate the spread of 
COVID-19; testing centers were tempo- 
rarily closed, and residents of southwest 
Louisiana, which bore the storm’s brunt 
and had been recording some of the 
state’s highest rates of positive test 
results, evacuated elsewhere. Seven hur- 
ricanes and tropical storms have hit 

the United States so far this year, one 

of the most active seasons on record. 
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Dutch mink farms ordered shut 


AGRICULTURE | The Dutch government 

last week decided to end mink farming to 
prevent the animals from becoming sources 
of the virus that causes COVID-19. More than 
40 mink farms in the Netherlands—almost 
one in three—have had outbreaks of the 
virus since late April, triggering massive 
culls. A Dutch law adopted in 2012 banned 
mink farming by 2024 for ethical reasons, 
but now the remaining farms must close by 
March 2021. The government has set aside 
€182 million to indemnify farmers. Although 
farms implemented hygiene rules, scientists 
suspect infected people carried the virus into 
them. Denmark, Spain, and the United States 
have seen outbreaks at mink farms as well. 


Poop test halts college cluster 


PUBLIC HEALTH | By testing dormitory 
wastewater for SARS-CoV-2, the virus that 
causes COVID-19, the University of Arizona 
may have stamped out a potential outbreak 
before it could spread. Several countries, 
US. municipalities, and some universities 
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Hurricane Laura’s damage came mostly 
from wind, but floodwaters surrounded a 
house in Little Chenier, Louisiana. 


have been checking sewage for RNA from 
the virus, which can signal infections shortly 
before clinical cases and deaths are recorded. 
In Arizona, officials announced last week 
that wastewater from a student dormitory 
contained the viral RNA just days after stu- 
dents had moved into their rooms in August; 
all 311 residents and dorm workers had 
previously tested negative on a mandatory 
test for COVID-19. The university retested all 
of them and found two students who were 
asymptomatic but positive for the virus; they 
were then quarantined. 


A boost for rapid COVID-19 testing 


DIAGNOSTICS | The U.S. Food and 

Drug Administration last week issued an 
emergency use authorization to Abbott, a lab- 
oratory company, for a 15-minute test for the 
COVID-19 virus that could help expand the 
number of Americans regularly tested. The 
new diagnostic, called BinaxNOW, detects 
proteins, or antigens, that are unique to the 
virus with high accuracy and at a cost of only 
$5 each. Other coronavirus tests that identify 
genetic material unique to the virus typically 
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cost $100, and laboratories often take days to 
provide results. The genetic tests and other, 
antigen-based ones require specialized lab 
equipment; Abbott’s does not, although a 
health care professional must administer it. 
The company says it plans to produce 50 mil- 
lion tests in October. Last week, the Trump 
administration announced it would buy 150 
million. The United States currently conducts 
about 700,000 tests for the virus per day. 


CDC relaxes testing guidelines 


PoLicy | The US. Centers for Disease 
Control and Prevention (CDC) drew criti- 
cism last week for revising its guidelines to 
state that people exposed to the virus that 
causes COVID-19 “do not necessarily need a 
test” if they lack symptoms and do not have 
medical conditions that make them vulner- 
able. Scientists and public health specialists 
slammed the 24 August revision, noting 
that people who do not feel sick can still 
spread the virus and that the United States 
continues to lead the world in COVID-19 
cases and deaths. Trump administration 
officials have said too many people have been 
getting tested out of fear and tests should 
be reserved for those at highest risk, The 
New York Times reported. But CDC Director 
Robert Redfield appeared to muddy the mes- 
sage when he said on 26 August that testing 
“may be considered for all close contacts of 
confirmed or probable COVID-19 patients.” 


Virus hunters get new money 


INFECTIOUS DISEASES | The EcoHealth 
Alliance, a nonprofit whose highly scored 
grant to study bat coronaviruses that could 
jump to humans in China was summarily 
defunded after President Donald Trump 
targeted it, has received new funding worth 
$7.5 million over 5 years, the U.S. National 
Institutes of Health (NIH) announced last 
week. In April, Trump alleged without 
evidence that the COVID-19 virus escaped 
from the Wuhan Institute of Virology; the 
EcoHealth Alliance had collaborated with 
scientists there on the canceled grant. NIH 
ended it days later, drawing strong protests 
from scientists. The newly funded work will 
not revive the earlier project but instead will 
focus on risks of animal viruses jumping to 
humans in Southeast Asia, but not China. 
The EcoHealth Alliance is one of 11 groups 
NIH plans to fund with $82 million to study 
such risks. “It’s a relief for us to know that 
NIH isn’t going to black-ball our organization 
because of political interference,” EcoHealth 
Alliance President Peter Daszak says. 
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United States boosts Al, quantum research 


he United States will establish a dozen centers to study artificial in- 

telligence (AI) and quantum information science (QIS), the White 

House announced last week. The seven university-based AI cen- 

ters will receive $20 million each over 5 years from the National 

Science Foundation or the Department of Agriculture and will 

use AIl—algorithms that can learn to recognize patterns—to tackle 
problems in areas ranging from farming to particle physics. The five cen- 
ters on QIS, located at the Department of Energy’s national laboratories, 
will focus on topics such as developing quantum computers that could 
solve challenges that would overwhelm conventional computers. Each of 
these centers will receive $125 million over 5 years, as Congress called for 
in the 2018 National Quantum Initiative Act. 


China’s R&D budget keeps rising 


FUNDING | China continued in 2019 

its yearslong run of double-digit annual 
percentage increases in spending on 
R&D. But it has not yet reached its long- 
standing goal of increasing R&D expen- 
ditures to 2.5% of gross domestic product 
(GDP). Total public and private science 
and technology expenditures in 2019 rose 
12.5% to 2.21 trillion Chinese yuan ($322 
billion), the National Bureau of Statistics 
of China reported last week. Most (83%) 
went to development, while basic research 
received 6% and applied research 11%. 
Relative to other countries, China has 
been spending more on development 
and less on basic research. Its total R&D 
spending in 2019 amounted to 2.23% 

of GDP, still short of the United States’s 
2.83%. China was the world’s second big- 
gest spender on R&D behind the United 
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3.4 


million 
Square kilometers of sea floor 
changed by human activities, such 
as the construction of ports, 
communication cables, oil rigs, and wind 
farms, as of 2018, representing an 
estimated 1.5% of all coastal areas. 


WHY IT MATTERS 
The modified area equals that of 
cities on land, and its marine 
ecosystems may have sustained damage 
(Nature Sustainability). 


Published by AAAS 


States in 2018, the latest year for which a 
comparison is available, according to the 
Organisation for Economic Co-operation 
and Development. Analysts expect China 
to continue to close the gap. 


Intermediate black hole found 


ASTRONOMY | Gravitational wave hunt- 
ers have netted a big fish: the signal from 
a pair of black holes merging to produce 
one with a mass of about 142 Suns. 

That heft makes it the first confirmed 
intermediate-mass black hole, with a 
mass between those produced by col- 
lapsing stars and the giant black holes at 
the hearts of galaxies. Detected in May 
2019 by the twin Laser Interferometer 
Gravitational-wave Observatory facili- 
ties in the United States and the Virgo 
detector in Italy, the merger is also the 
most distant seen, at 7 billion light-years 
away, as well as the most powerful, with 
the mass of eight Suns converted into 
energy. The masses of the individual black 
holes—85 and 66 solar masses—before 
they merged pose a puzzle, as theorists 
believe it impossible to make a black hole 
heavier than 65 Suns from the collapse 
of a single star. The discovery is reported 
this week in Physical Review Letters and 
The Astrophysical Journal Letters. 


Iran allows nuclear inspections 


NONPROLIFERATION | After a 
monthslong impasse, Iran has agreed 

to allow international inspectors access 
to two sites that were allegedly part of 

a clandestine nuclear weapons program. 
The move preserves, for now, what 
remains of a multination nuclear deal 
reached in 2015, from which the Trump 
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administration has withdrawn. The inspec- 
tions will take place at Abadeh, a testing 
range for high explosives in central Iran, and 
at an undisclosed site, which intelligence 
reports revealed might have contained unde- 
clared nuclear materials and activities. Iran 
had rebuffed requests from the International 
Atomic Energy Agency to take samples at 
the sites; continued stonewalling could 

have prompted the agency to declare Iran 
out of compliance with its commitments. 
The United States maintains that Iran has 
violated the nuclear deal, and banking and 
other sanctions lifted after the 2015 accord 
must automatically resume. But members 

of the United Nations Security Council last 
week reiterated their disagreement with that 
interpretation, and the United States now 
plans to reimpose those sanctions unilater- 
ally on 20 September. 


ARCHAEOLOGY 


U.S. coal ash rules loosened 


ENVIRONMENT | The Trump administra- 
tion on 31 August eased rules on toxic 
wastewater created by coal-burning power 
plants, which operators discharge into 
rivers and streams. The move changes a 
rule adopted in 2015 by former President 
Barack Obama’s administration requiring 
plant operators to treat and recycle water 
used to store coal ash, which contains 
mercury and arsenic, by 2023. The Trump 
administration’s version instead exempts 
plants set to close or switch to natural 
gas by 2028 and allows other plants to 
delay compliance until that year if they 
voluntarily adopt advanced biological 
treatment. The administration says its 
rule will save the industry money and 
retain coal-industry jobs while reducing 


A study traced helminths including whipworms 
_ (Trichuris trichiura), which grow up to 

50 millimeters long and infect 

human intestines. 


Intestinal worms rampant in medieval Europe 


iny parasitic worms known as helminths cause malnutrition and developmental 
disorders in some 1.5 billion people around the world, mostly in developing 
countries. Scientists now report new evidence that better sanitation can allevi- 
ate this scourge. During the Middle Ages and for centuries after, worm infections 
were as prevalent among Europeans as they are today in people living in parts 
of sub-Saharan Africa, South America, and East Asia, the authors report in a paper pub- 
lished on 27 August in PLOS Neglected Tropical Disease. They based the conclusion 
on an analysis of 589 samples from skeletons in medieval cemeteries in the Czech 
Republic, Germany, and the United Kingdom. Because such worms were eradicated 
in Europe before effective antiparasite drugs were developed, the results reinforce 
the idea that improvements to water supplies, sanitation, and hygiene can dramati- 
cally reduce the disease burden they cause today. 
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total pollution by 1 million pounds annu- 
ally, over and above the 1.4 million pound 
reduction anticipated under the Obama 
rule. But environmental groups rejected 
those assertions and predicted that power 
plants—now the largest contributors of 
industrial water pollution—will discharge 
even more. The critics add that the move 
will prop up coal power, which is respon- 
sible for emitting a significant share of 
global warming gases. 


Black turbine paint saves birds 


CONSERVATION | A Norwegian wind farm 
has devised an inexpensive method that 
may prevent birds from being killed by tur- 
bines’ rotating blades. By painting only one 
turbine blade black, the farm reduced bird 
collisions by more than 70%, say research- 
ers who conducted the first field study 

of the approach. Fast-moving, monotone 
blades can be difficult for birds to see; in 
the United States alone, collisions with 
wind turbines kill 140,000 to 500,000 birds 
each year. But a single contrasting black 
blade makes this rotating obstacle easier 
for birds to identify and avoid, researchers 
report in the 27 August issue of Ecology 
and Evolution. The approach needs further 
validation, other researchers say. And they 
note that windmills still rank low on the 
list of threats to birds: Collisions with 
power wires and communication towers kill 
an estimated 32 million birds in the United 
States annually, for example, and cats are 
believed to kill 2.4 billion each year. Loss of 
habitat is another leading threat. 


Togo ends sleeping sickness 


INFECTIOUS DISEASES | Togo is the 

first African country to have eliminated 
Human African trypanosomiasis (HAT), 
better known as sleeping sickness, as a 
public health problem. The World Health 
Organization (WHO) on 25 August 
certified the country as free of HAT, 
which is caused by two subspecies of the 
Trypanosoma brucei parasite and spread 
by tsetse flies. Occurring only in sub- 
Saharan Africa, HAT causes neurological 
damage and is fatal when left untreated. 
Surveillance and control programs have 
helped bring reported cases down sharply, 
from more than 25,000 in 2000 to 980 last 
year. WHO hopes the subspecies 7: 0. 
gambiense, which occurs in West and 
Central Africa and is responsible for more 
than 98% of cases, can be eliminated alto- 
gether by 2030. “I am sure [Togo’s] efforts 
will inspire others,’ Matshidiso Moeti, 
WHO regional director for Africa, said 

in a statement. 
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COVID-19 


Can Europe tame the pandemic’s next wave? 


Countries seek new strategies as coronavirus Cases are rising again across the continent 


By Kai Kupferschmidt 


ere at risk of gambling away 

our success,” virologist Christian 

Drosten warned in the German 

newspaper Die Zeit earlier this 

month. His message referred to 

Germany, but it could have been 
addressed to all of Europe. After beating back 
COVID-19 in the spring, most of Europe is 
seeing a resurgence. Spain is reporting close 
to 10,000 cases a day, more than it had at the 
height of the outbreak in the spring. France 
is back to reporting thousands of cases a day. 
In Germany, numbers are still low, but ris- 
ing steadily. The pandemic is affecting coun- 
tries that saw few cases in the spring, such 
as Greece and Malta, but is also rebounding 
in places that suffered terribly, including the 
cities of Madrid and Barcelona. 

Drosten, of the Charité University Hos- 
pital in Berlin, is one of many calling for 
renewed vigilance, and he and others are 
urging a new control strategy that trades 
blanket lockdowns for measures specifi- 
cally targeting clusters of cases, which play 
a key role in spreading the coronavirus. 
“We successfully aborted the [first] wave 
and now we should make sure that no 
new wave builds,” epidemiologist Christian 
Althaus of the University of Bern says. 

Few dispute that Europe rose to the initial 
challenge. In Bergamo, the capital of Italy’s 
Lombardy region, crematoria were so over- 
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burdened in March that army trucks had to 
transport the dead to other cities—but on 
24 May, Lombardy registered zero COVID-19 
deaths for the first time. By early July, the 
European Union and the United Kingdom 
together averaged fewer than 5000 new 
cases per day, whereas the United States 


The coronavirus comeback 

The number of new COVID-19 cases soared this past 
month in France (not shown) and Spain. Germany 
and other European countries saw a slower increase. 
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and Brazil (which together have roughly the 
same population) had 50,000 and 40,000, re- 
spectively. Europeans enjoyed a surprisingly 
normal summer, with northern Europeans 
flocking to Mediterranean beaches. 

The rising case numbers today aren’t 
quite comparable to the peak in April be- 
cause countries are now testing far more 
people on a daily basis. But the increase 
shows that Europe relaxed measures too 
early and too much, says virologist Ab 
Osterhaus of the University of Veterinary 
Medicine in Hanover, Germany. “The wrong 
message was given, basically: We have done 
a great job and now we can relax again.” In- 
stead, Europe could have tried to emulate 
New Zealand by stopping community trans- 
mission completely and zealously guarding 
against reintroductions, says Devi Sridhar, 
a global health expert at the University of 
Edinburgh who has been advising the Scot- 
tish government. Scotland committed early 
on to pushing case numbers down to zero, 
but other countries did not, and now almost 
all are seeing a resurgence. 

People’s willingness to stay alert and 
remember new rules wanes quickly, says 
Cornelia Betsch, a psychologist at the Uni- 
versity of Erfurt who has been monitoring 
attitudes toward the pandemic in Germany. 
“And we have been going for a while now, 
and the end is not even clear.’ Some coun- 
tries saw workplace infections rise as people 
returned to their offices, says Gianfranco 
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Spiteri, a public health expert at the Euro- 
pean Centre for Disease Prevention and Con- 
trol. But in many countries the resurgence is 
driven by “young people partying and basi- 
cally people living their life back in a kind of 
normal way,’ he says. Because new cases are 
younger, fewer of them die, but “it’s a mat- 
ter of time before the elderly are affected,” 
Spiteri says. The reopening of schools across 
the continent may make matters worse. 

As in the spring, every country has its own 
strategies for controlling the pandemic, lead- 
ing to a sometimes confusing patchwork. 
Belgium has one of the strictest face mask 
policies, for instance, but Belgians crossing 
the Dutch border to shop in Maastricht can 
take off their masks. Even within countries, 
the rules can change at dizzying speed. Ger- 
many went from a mandatory 14-day quar- 
antine for people arriving from countries 
considered risky to voluntary tests at the 
airport and other entry points, with no quar- 
antine for those who tested negative. Next, it 
made the tests mandatory, then returned 
to mandatory quarantine with testing after 
5 days. “What would be necessary is that 
we define one central policy in Europe,” 
Osterhaus says. “The problem is, 


that person’s contacts—than the new case’s 
contacts; after all, the patient may not infect 
anybody else, but is likely to have caught the 
virus as part of a cluster, Drosten says. 

Adam Kucharski, a disease modeler at the 
London School of Hygiene & Tropical Medi- 
cine, agrees. “Looking backwards can actu- 
ally give you a disproportionate benefit in 
terms of identifying infections,” he says. In a 
recent preprint, Kucharski and his colleagues 
estimated that “backward contact tracing” 
could prevent twice as many infections as 
tracing contacts forward alone. Experience 
in South Korea, where clusters at churches 
drove the epidemic early on, confirmed the 
value of this approach, says University of 
Florida biostatistician Natalie Dean. 

Putting more effort into finding clusters 
should also help epidemiologists under- 
stand where and how they emerge, says 
Hitoshi Oshitani of Tohoku University in 
Japan—which may have changed since 
the spring. “We’ve seen a massive change 
in the social structure and interactions of 
populations ... from the start of the pan- 
demic,’ Kucharski says. The conditions 
that spread the virus then “won’t necessar- 

ily be the same ones that are creat- 


who is going to do that?” The Eu- - _—_mmm— ing the risk now.” In Germany, for 
ropean Union has little power to Science's instance, many large outbreaks 
coordinate health measures. COVID-19 early in the pandemic occurred 

Yet countries are better prepared reporting is in long-term care facilities. Now, 


this time. Whereas the virus spread 
largely under the radar in Febru- 


supported by the 
Pulitzer Center 


clusters are increasingly reported 
from workplaces. 


and the 
ary, widespread testing now reveals Heising-Simons More-targeted measures prob- 
its movements. (Fewer than 3% of Foundation. ably won’t be enough to keep the 


tests are positive in most European 
countries, a sign of a healthy testing capac- 
ity.) Face masks, not available or even rec- 
ommended in the beginning, have become 
ubiquitous in most countries. More than a 
dozen EU countries have developed apps 
to help contact tracing efforts. Better treat- 
ments are saving lives. 

Meanwhile, new insights into viral spread 
are leading to better targeted control mea- 
sures. The emphasis on hand hygiene is 
gone because it has become clear that con- 
taminated surfaces don’t play a large role. 
In the spring, some countries banned al- 
most any outdoor activity, including jog- 
ging; now, the focus is on indoor activities. 
“We've learned outdoor hospitality is gener- 
ally fine, nonessential shops are fine as long 
as people wear face coverings, public trans- 
port doesn’t seem that risky,” Sridhar says. 

Instead, public health experts increasingly 
argue for targeting clusters of cases and 
superspreading events. Some studies esti- 
mate that 10% of patients cause 80% of all in- 
fections, whereas most don’t infect anybody 
at all (Science, 22 May, p. 808). Drosten has 
urged that contact tracers spend more time 
finding the source of a new case—along with 
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virus from  resurging, Althaus 
says. “A point will be reached again where 
stricter measures have to be taken,” he says. 
But rather than complete lockdowns, he as- 
sumes they will be more like the lighter ver- 
sion applied in Sweden, which encouraged 
people to work from home and banned large 
gatherings while keeping shops and restau- 
rants open. Scotland recently closed pubs 
and restaurants in Aberdeen for more than 
2 weeks after a cluster of cases emerged; it 
asked inhabitants not to travel more than 
8 kilometers outside the city and visitors to 
stay away. But schools remained open. 
Compared with the United States, Eu- 
rope has one advantage as it faces its first 
pandemic winter: Control measures aren’t 
nearly as controversial. Protests against 
masks and social distancing broke out in 
many European cities in August, but they 
represented a small minority of the popula- 
tion, Betsch says. In Germany, support for 
control measures declined somewhat after 
infections peaked in spring, but a large 
majority still backs them, Betsch says. And 
with case numbers back on the rise, she 
says, “We can already see acceptance num- 
bers go up again.” 
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RESEARCH FUNDING 


Bill threatens 
key Brazilian 
universities 


Proposal to strip Sao Paulo 
institutions of reserve funds 
draws fierce opposition 


By Ignacio Amigo 


cademic researchers in Sao Paulo, 

Brazil’s wealthiest and most popu- 

lous state, are warning that a pro- 

posed budget bill could cripple 

major universities and long-term re- 

search projects. The state is home to 
three of Latin America’s most prestigious 
universities and produces 40% of Brazil’s 
scientific publications. 

The proposal, now before the state’s 
legislature, aims to avoid a 10.4 billion 
reais ($1.9 billion) shortfall in Sao Paulo’s 
2021 budget, caused in large part by the 
COVID-19 pandemic. One provision calls 
for the three major academic institutions— 
the University of Sao Paulo (USP), the Uni- 
versity of Campinas (Unicamp), and Sao 
Paulo State University—to transfer money 
in their long-term reserve accounts to the 
state government. The Sao Paulo Research 
Foundation (FAPESP), a state agency that 
funds research and fellowships, would also 
have to hand over its reserve funds. To- 
gether, researchers estimate, the accounts 
hold more than 1 billion reais, money the 
institutions rely on to weather economic 
challenges and pay for long-term projects. 

The prospect, coming on top of a 
yearslong decline in science funding from 
the federal government, has sparked an 
outcry among researchers. If enacted in 
its current form, the bill “will paralyze 
all scientific activities in the state of Sao 
Paulo,” the Brazilian Academy of Sciences 
predicted in a 17 August letter. The Brazil- 
ian Society for the Advancement of Science 
warned the same day of “irreversible dam- 
age.” As Science went to press, more than 
110,000 people had signed an online peti- 
tion opposing the bill, issued by the Sao 
Paulo Science Academy. 

“We would have to close some [research] 
areas, we would struggle to pay salaries” 
if the proposal becomes law, says Marcelo 
Knobel, Unicamp’s chancellor. “It would be 
an unprecedented situation.” 


sciencemag.org SCIENCE 


PHOTO: © BAYER 


The state government asserted in a state- 
ment, however, that “the fiscal adjustment 
will not paralyze research,” and that “sci- 
ence has the full support” of the govern- 
ment. The bill would only redirect “surplus 
resources” to “remedy a current problem, 
which is the need to pay civil servants, in- 
cluding teachers from these institutions 
themselves, with the sharp drop in revenue 
caused by the pandemic. ... Certainly, it is 
not fair for the poorest population to be 
unassisted with medicines or health care, 
while universities and FAPESP may have 
surplus cash.” 

University and foundation officials say 
that far from being a luxury, the reserve 
funds are vital for covering budget deficits 
during lean years. Each of the institutions 
receives a fixed share of state tax revenues, 
but Brazil’s struggling economy has made 
that income less reliable in recent years. 
Unicamp, for example, has had to dip into 
its reserves to cover operating expenses in 
each of the past 6 years, Knobel says. 

At FAPESP, loss of its reserve could 
threaten funding already committed 
to multiyear projects. The foundation 
typically supports projects that last 2 to 
11 years but does not release the total fund- 
ing up front. Instead the money is paid 
in installments. As a result, claims that 
money in the foundation’s reserve fund “is 
a surplus” are “not true,” says Mayana Zatz, 
a USP geneticist. Much of that money “is 
already committed.” 

Zatz and others say FAPESP’s reserve 
has also enabled it to respond rapidly to 
requests for funding to study emerging is- 
sues, such as the COVID-19 pandemic and 
the earlier Zika virus outbreak. And the 
foundation has been a lifeline for research- 
ers in Sao Paulo in recent years as the 
federal government has reduced spending 
on science, cutting research budgets by 
roughly half since 2014. 

Last week, the chancellors of the three 
state universities met with members of the 
government to discuss the proposal, but 
they failed to win removal of the provision. 
Now, researchers are hoping their public 
campaign will persuade the assembly to re- 
ject it. But they are uneasy. “Unfortunately, 
most [lawmakers] have little understand- 
ing of how science works, so it’s worrying,” 
says Ohara Augusto, a chemist at USP and 
the coordinator of a large FAPESP research 
project. She fears passage of the bill could 
end decades of progress on building sci- 
entific excellence in Brazil. “Things were 
going well. ... They weren’t perfect but 
we had hope,” she says. “But now this bill 
could bury our hope.” 


Ignacio Amigo is a journalist in Madrid. 
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A solution of clothianidin and 
another insecticide is sprayed on 
the walls of a home in Rwanda, 


Malaria fighters’ latest chemical 
weapon may not last long 


Clothianidin-resistant mosquitoes already seen in Cameroon 


By Munyaradzi Makoni 


n insecticide about to be widely de- 

ployed inside African homes to com- 

bat malaria-carrying mosquitoes is 

already losing its punch. Two years 

ago, the World Health Organization 

(WHO) gave the green light for clo- 
thianidin, long used in agriculture to kill crop 
pests, to be added to the current mainstays 
of indoor mosquito control, which are losing 
their effectiveness as the insects develop re- 
sistance. Since then, many African countries 
have been laying plans to spray walls with the 
pesticide, which represents the first new class 
of chemicals adopted for such home use in 
decades. They’ve also been looking anxiously 
for pre-existing resistance. 

Now, scientists at Cameroon’s Centre for 
Research in Infectious Diseases have found 
it. They recently sampled mosquito species, 
including two key malaria carriers, from 
rural and urban areas around Yaoundé, the 
capital. In one standard assay, exposure 
to clothianidin for 1 hour killed 100% of 
Anopheles coluzzii. But in some A. gambiae 
samples as many as 55% of the mosquitoes 
survived, the group reported in an online 
preprint last month. 

Corine Ngufor, a medical entomologist at 
the London School of Hygiene & Tropical 
Medicine, says this appears to be the first 
report of clear resistance to clothianidin in 
malaria-carrying insects. “It may spread very 
quickly and make this new class of insecti- 
cide almost useless for malaria vector con- 
trol within a few years,” she warns. 

Colince Kamdem, who led the study, says 
agricultural use of neonicotinoids—the class 
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of chemicals to which clothianidin belongs— 
likely drove the emergence of the resistant 
mosquito strains. “WHO would never have 
recommended this insecticide if such data 
were available,’ he contends. 

Bed nets coated with long-lasting insecti- 
cides and indoor spraying have helped halve 
malaria mortality and morbidity in the past 
2 decades. These programs use four classes 
of insecticides but rely most on pyrethroids, 
which are cheap and considered safe around 
people, Kamdem says. 

To combat the rise of pyrethroid-resistant 
mosquitoes, WHO added clothianidin to its 
“prequalified” list of chemicals acceptable 
for indoor spraying (and potentially nets). 
Neonicotinoids have become controversial 
because of their impacts on pollinators; Eu- 
rope has banned their use in agriculture. But 
farms in Africa still heavily use them. In ag- 
ricultural areas, Kamdem says, pesticide resi- 
dues contaminate standing water that serves 
as breeding sites for mosquito larvae, favor- 
ing the evolution of neonicotinoid resistance. 

WHO has not reviewed the Cameroon 
study because it has not yet been published 
in a peer-reviewed journal, says Deusdedit 
Mubangizi, who coordinates the agency’s 
“prequalification” assessments, including 
those of insecticides used for mosquito con- 
trol. But he thinks the chemical could still 
be an asset against malaria. “Resistance to 
clothianidin is much less prevalent than to 
other alternative insecticides in current use,” 
he says. How long that will last is the great 
unknown—and concern. 


Munyaradzi Makoni is a journalist based 
in Cape Town, South Africa. 
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An ecosystem goes topsy-turvy 
as a tiny fish takes over 


Predator-prey reversal has dramatically altered the 
underwater ecosystem along the Baltic Coast archipelago 


By Elizabeth Pennisi 


o bigger than a minnow, the three- 
spine stickleback may seem a puny 
player in the underwater world. But 
along the European coastline of the 
Baltic Sea, it has edged out its own 
predators—toothy pike and perch, 
fish that grow longer than your forearm. 
Records dating back 40 years show how the 
flamboyant little stickleback has shifted 
the ecosystem, thwarting efforts to restore 
the larger species favored by human fish- 
ers. “A little pelagic fish that many people 
ignore is having a dramatic ecological 


impact,” says Brad deYoung, an oceano- 
grapher at Memorial University who was 
not involved with the work. 

Ecologists say what has happened in the 
Baltic is a dramatic example of a predator- 
prey reversal, in which two species trade 
places on the food chain, drastically alter- 
ing the rest of the ecosystem. “It shows you 
really need to understand not just who 
eats who, but who might eat who to prop- 
erly manage [fish stocks],’ deYoung says. 

Johan Ekl6f grew up along Sweden’s Bal- 
tic coast and fondly remembers catching 
plentiful Eurasian perch (Perca fluviatilis). 
Later, as an ecologist at Stockholm Univer- 


With a mouth big enough to eat baby pike and perch, 
this stickleback species has turned on its predators. 


sity, Eklof and his colleagues noted that the 
three-spine stickleback (Gasterosteus acu- 
leatus) seemed more and more common in 
coastal waters. To find out what was going 
on, the researchers unearthed 13,000 sur- 
veys of fish done between 1979 and 2017 
by scientists and fisheries managers along 
1200 kilometers of the western coast of the 
Baltic Sea. “This paper is a good example of 
using past data, which can sometimes seem 
dull, to explore a problem that cannot be 
addressed any other way,” deYoung says. 

In the 1980s, Ekl6f and colleagues found, 
sticklebacks outnumbered not just perch, 
but also Northern pike (Esox Lucius), at the 
seaward edges of the many islands and shal- 
low bays along the Baltic coast. That’s not 
surprising—pike and perch are freshwater 
fish, able to survive in the ocean only where 
river outflows lower salinity. Those fish pre- 
vailed in the fresher waters 8 kilometers 
closer to shore. But in the 1990s, stickle- 
backs began to outnumber their predators 
closer to land, their dominance spreading 
toward more protected bays and inland 
waters. By 2014, sticklebacks reigned a full 
21 kilometers landward from the archipela- 
go’s edge, Eklof and his colleagues reported 
last week in Communications Biology. 

The sticklebacks themselves probably 
didn’t initiate their predators’ decline. In- 
stead, complex ecological factors appear to 
have first touched off the shift. Beginning 
in the 1990s, gray seals became more com- 
mon, thanks to better water quality and an 
end to bounty hunting. The seals, along 
with cormorants, began to eat more pike 
and perch. Meanwhile, sticklebacks were 
thriving in the rapidly warming seas. And 
overfishing of cod and large herring meant 
stickleback had fewer predators. 


Stickleback surge 


Predatory fish (orange) such as pike and perch once dominated the waters around the archipelago along the Baltic coast. But sticklebacks (red) have now taken over. 
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As the stickleback grew numerous, they 
became a formidable foe: They eat juvenile 
pike and perch. In earlier studies, co-author 
Ulf Bergstr6m from the Swedish Univer- 
sity of Agricultural Sciences and colleagues 
found both species in the stomachs of stick- 
lebacks. Ekl6f, Bergstr6m, and their col- 
leagues caught and analyzed fish in 32 bays 
and confirmed that where stickleback were 
abundant, juvenile pike and perch were 
scarce. Thus, as stickleback became more 
plentiful in more places, pike and perch had 
even less chance to recover. 

This is not the first time that scientists 
have documented a predator-prey reversal. 
For example, large populations of herring in 
the North Sea likely drove down numbers of 
cod, their predator, by feasting on tiny cod 
juveniles. But such connections have been 
difficult to document. “This result seems re- 
markably clear,” deYoung says. 

The work also stands out because it 
documents such a widespread and last- 
ing ecological shift, adds Steve Carpenter, 
a limnologist at the University of Wiscon- 
sin, Madison. More typically, research- 
ers have observed such shifts in a single 
location, often a lake, showing how domi- 
nance swings back and forth between two 
species as temperature changes or fishing 
becomes more intense, he says. The new 
results “show that regime shifts can spread 
among connected habitats and transform 
an entire coastline rather rapidly.” 

The stickleback surge is triggering other 
ecosystem impacts. The fish eat snails 
and crustaceans that previously kept 
green algae in check, favoring the return 
of algal blooms that had been declining 
in these waters thanks to pollution con- 
trol measures. 

The work “clearly shows that the [dis- 
appearance] of larger predators can cause 
cascading effects all the way down to algae, 
and that these changes can unfold over vast 
spatial scales like falling dominoes,” says 
Boris Worm, a marine biologist at Dalhou- 
sie University. Worm worked in the Baltic 
Sea as a Ph.D. student, and he mourns the 
change, calling it “a slow-motion disaster 
through the Baltic Sea.” 

Ekl6f and others are now considering 
how to bring back pike and perch, per- 
haps by locally fishing out stickleback or 
stocking bays with juvenile pike and perch 
too big for stickleback to eat. For now, the 
lesson is clear. “The world is changing at 
a very fast rate and ecosystems are shift- 
ing, most times to less desirable states,” 
says Julian Torres Dowdall, an evolution- 
ary biologist at the University of Konstanz. 
How politicians and managers respond to 
the result of this study “is important to 
our planet.” 
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POLICY 


Cannabis research data reveals 
a focus on harms of the drug 


Funding for therapies grows slowly, new analysis finds 


By Cathleen O’Grady 


or research funders, marijuana is still 

more vice than virtue. A new analysis 

of cannabis research funding in the 

United States, Canada, and the United 

Kingdom has found that $1.56 billion 

was directed to the topic between 2000 
and 2018—with about half of the money 
spent on understanding the potential harms 
of the recreational drug. Just over $1 bil- 
lion came from the biggest funder, the U.S. 
National Institute on Drug Abuse (NIDA), 
which doled out more money for research on 
cannabis misuse and its negative effects than 
for studies of cannabis and cannabis-derived 
chemicals as a therapeutic drug. Total Ca- 
nadian and U.K. spending was far lower, at 
$32 million and $40 million, respectively; 
U.K. funding also emphasized harms. 

The data confirm “word on the street” 
that government grants go to research that 
focuses on harms, says Daniela Vergara, who 
studies cannabis genomics at the University 
of Colorado, Boulder. Still, research on the 
medical potential of cannabis is growing 
alongside overall cannabis research funding, 
which rose from about $30 million in 2000 to 
more than $143 million in 2018. 

NIDA’s traditional focus is on addic- 
tion and the adverse effects of drugs, 
a spokesperson says. However, recently, 
the agency has begun to explore the thera- 
peutic potential of cannabinoids to treat 


addiction, the spokesperson adds. 

The analysis is based on data assembled by 
Jim Hudson, a medical research consultant 
who collected publicly available grant data 
from 50 funding agencies and charities. He 
classified the grants into categories based 
on keywords. It’s the first attempt to analyze 
cannabis grant data from a wide range of 
sources, says Lee Hannah, a cannabis policy 
researcher at Wright State University. 

The analysis, posted last week on Hud- 
son’s website, hints at the legal hurdles to the 
research. In 2018, the $34 million spent on 
therapeutics went mostly to research on can- 
nabinoids—chemicals in cannabis—rather 
than the cannabis plant itself. Vergara says 
practicality is one reason: It’s often easier for 
researchers to work with these isolated com- 
pounds. But it’s also hard to get permission to 
use the whole marijuana plant. The only legal 
U.S. producer of research cannabis is the Uni- 
versity of Mississippi, which grows cannabis 
that is less potent than recreational pot. 

The limited funding for therapeutic re- 
search is part of a vicious circle, says Daniel 
Mallinson, a cannabis policy researcher at 
Pennsylvania State University, Harrisburg. 
Research is restricted because the U.S. Drug 
Enforcement Administration considers mari- 
juana to have high potential for abuse and 
does not see evidence for medical benefits. 
But the evidence needed to show medical 
benefits is hard to get because the research is 
restricted, Mallinson says. 


Higher purposes 


In 2018, research on the potential harms of cannabis received 
about twice as much funding as research on cannabis and 
cannabinoid therapeutics, according to an analysis of grants 


from 50 public agency and charity funders. 
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THE CARBON VAULT 


Industrial waste can combat climate change 
by turning carbon dioxide into stone 


n July 2019, Gregory Dipple, a geolo- 

gist at the University of British Colum- 

bia, Vancouver, hopped on a 119-seat 

charter flight in Yellowknife, Canada, 

and flew 280 kilometers northeast to 

the Gahcho Kué diamond mine, just 

south of the Arctic Circle. Gahcho 

Kué, which means “place of the big 

rabbits” in the Dénésuliné language 

of the region’s native Dené or Chipewyan 

people, is an expansive open pit mine 

ringed by sky-blue lakes. There, the mining 

company De Beers unearths some 4 mil- 

lion carats’ worth of diamonds annually. 

But Dipple and two students weren’t there 

for gems. Rather, they were looking to use 

the mine’s crushed rock waste as a vault to 
lock up carbon dioxide (CO,) for eternity. 

At Gahcho Kué, Dipple’s team bubbled 

a mix of CO, and nitrogen gas simulating 
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By Robert F. Service 


diesel exhaust through a grayish green 
slurry of crushed mine waste in water. 
Over 2 days, the slurry acquired a slight 
rusty hue—evidence that its iron was oxi- 
dizing while its magnesium and calcium 
were sucking up CO, and turning it into 
to carbon-based minerals. The CO,-hungry 
waste from the diamond mine is an exotic 
deep-earth rock, shot up to the surface 
in the volcanic eruptions that bring up 
diamonds. But a wide array of rock and 
mudlike wastes from mining, cement and 
aluminum production, coal burning, and 
other large-scale industrial processes share 
a similar affinity for the greenhouse gas. 
Known as alkaline solid wastes, these ma- 
terials have a high pH, which causes them 
to react with CO,, a mild acid. And unlike 
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other schemes for drawing excess CO, from 
the atmosphere, these reactive rocks can 
both capture the gas and store it, locked 
away permanently in a solid mineral. 

“The potential is real,’ Dipple says. “It 
will make an important contribution to 
lowering CO,,.” 

If he and others can make the scheme 
practical, it could address two environ- 
mental problems at once. Today, mines 
and industry generate some 2 billion tons 
of alkaline solid wastes every year, and 
more than 90 billion tons are stored be- 
hind fragile dams and heaped in waste 
piles, a threat to people and ecosystems 
(Science, 21 August, p. 894). In 2010, for ex- 
ample, a dam failure in Hungary released a 
2-meter-high wall of red mud—an alkaline 
waste from aluminum production—that 
killed 10 people and buried villages. And 
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caustic leachates from mountains of steel 
slag waste have wiped out fish populations 
in Pennsylvania and the United Kingdom. 

Reacting these wastes with CO, from the 
air could make them safer by solidifying 
them—and at the same time help the world 
avert climate disaster. In the 2015 Paris cli- 
mate agreement, most of the world’s coun- 
tries resolved to limit climate warming to 
below 2°C. For that to happen, the Inter- 
governmental Panel on Climate Change 
(IPCC) has determined, cutting greenhouse 
gas emissions won’t be enough. Countries 
will also need to employ “negative emis- 
sions technologies” (NETs) to pull as much 
as 10 billion tons (gigatons) of CO, out 
of the atmosphere every year toward the 
end of this century. Possible NETs include 
planting vast forests, which suck carbon 
out of the air as they grow; chemically ab- 
sorbing CO, from the air or power plant 
exhaust and pumping it underground; and 
growing grasses or shrubs, burning them 
for energy, and capturing and storing the 
CO, (Science, 16 February 2018, p. 733). 

But underground storage chambers can 
leak, and forests can burn. Mineralization 
is more permanent: Carbon-based miner- 
als, or carbonates, are among the most 
stable on Earth, adds Siobhan “Sasha” 
Wilson, a biogeochemist at the University 
of Alberta, Edmonton. “It’s a really robust 
place to store CO,,” she says. 
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through a slurry of rock waste in a tube (right) to test how to lock away carbon from diesel exhaust. 


And suitable rock waste is plentiful. Start 
with ultramafic wastes, the calcium- and 
magnesium-rich rock in which diamonds, 
along with metals such as nickel, platinum, 
and palladium are found. A 2019 report on 
NETs by the U.S. National Academy of Sci- 
ences (NAS) described CO, storage in ultra- 
mafic mine wastes as “low-hanging fruit.” 
Today, some 419 million tons of this and less 
alkaline “mafic” wastes are produced annu- 
ally. If fully carbonated, they could lock up 
175 million tons of atmospheric CO, per year. 
Then there are the alkaline wastes from alu- 
minum, iron, steel, and cement production, 
which could bring the total up to at least 
310 million tons—and by some estimates 
more than 4 gigatons (GTs)—of CO, trapped 


Trash to treasure 

ndustrial wastes could lock away 310 million tons 
of carbon dioxide every year. Each category of waste 
could mineralize the percentages shown. 


unicipal l Cement 
solid waste waste 
2.4% 16.3% 
ron and Paper 
steel industry 
slag waste 
43.5% 11.4% 
Coal Mining 
combustion waste 
12.3% - 13.5% 
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each year. The somewhat less alkaline basalt 
rock powder generated by coal production 
could sequester another 2 GTs per year, Phil 
Renforth of Heriot-Watt University and his 
colleagues have calculated—meaning alka- 
line wastes could in principle provide more 
than half of the negative emissions that 
IPCC called for. 

But there are major hurdles. Govern- 
ments will need to offer incentives for min- 
eralization on the massive scale needed to 
make a dent in atmospheric carbon. And 
engineers will need to figure out how to 
harness the wastes while preventing the 
release of heavy metals and radioactivity 
locked in the material. Still, “Alkaline 
wastes have tremendous potential,” says 
Liang-Shih Fan, a chemical engineer at 
Ohio State University, Columbus. “It’s a po- 
tential one should not overlook.” 


THE NOTION of storing CO, in minerals isn’t 
new. Plans to capture the gas from the air 
or power plant exhaust often call for inject- 
ing it into underground rock formations 
that, like mine waste, react to form carbon- 
ates. And certain rocks naturally capture 
CO, in a process known as weathering. In 
Oman, vast ridges of a mineral called peri- 
dotite mineralize CO, from the air, forming 
white veins resembling marbling in steak. 
Similar smaller formations dot the globe. 
Mine wastes behave the same way. In 
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At a platinum and palladium mine in Montana, a technician stirs a slurry of waste rock, some of the 2 billion tons of alkaline waste generated each year. 


2014, Wilson and colleagues analyzed mine 
tailings from the Mount Keith nickel mine 
in Western Australia and found that the 
mine’s 11 million tons of tailings produced 
each year spontaneously react with CO.,, 
locking up about 40,000 tons of the gas. 
That’s equivalent to about 11% of the CO, 
output from the mine’s operations. 

Still, weathering is slow, and most al- 
kaline wastes wind up either buried or 
submerged in ponds, and thus aren’t ex- 
posed to air. “It’s a matter of getting those 
reactions to happen at a faster rate,’ says 
Alison Shaw, a geochemist with Lorax En- 
vironmental Services who heads De Beers’s 
research on mineralizing CO,. 

At Gahcho Kué, Dipple and his students 
tested a way to speed up the process. The 
mine’s tailings include a wet, siltlike slurry 
and dry, sandlike grains. Dipple and his 
students packed a 6-meter-tall column 
with the greenish slurry and sprayed water 
on 1 cubic meter of the sand. With both 
their slurry and dry wastes, they bubbled 
in a mix of gases—10% CO, and 90% 
nitrogen—that matched the exhaust from 
the local diesel power plant that powers 
the mine. 

The waste soaked up CO, for as long as 
44 hours, they found, converting it into 
minerals. The newly made magnesium car- 
bonate minerals acted like glue, solidifying 
the previously free-flowing tailings, much 
like sand turned to sandstone. Most im- 
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portant, the waste took up CO, 200 times 
faster than it did through natural weather- 
ing, Dipple says. 

This summer, he was set to return to 
Gahcho Kué to scale up the tests and use 
actual diesel exhaust. But those tests are on 
hold because of the coronavirus pandemic. 

De Beers has funded other projects 
around the world, Shaw says. For example, 
Wilson’s team is exploring whether dilute 
acids speed up weathering. Lab studies sug- 
gest the acids could leach magnesium out 
of mine waste, making it available to react 


Bricks and mortar 

Small companies in a nascent industry are using 
carbon mineralization to capture carbon dioxide (CO,) 
in construction materials. 


COMPANY PRODUCT 
CarbonCure Technologies Concrete 
Solidia Technologies Concrete 
CO,Concrete Concrete 
Carbicrete Concrete 


Cambridge Carbon Capture _ Fire-retardant materials 


Mineral Carbonation Cement and plasterboard 


International 

0.C.0. Technology Construction aggregate 
Blue Planet Construction aggregate 
Orbix Construction aggregate 
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with CO,. Another project, led by Gordon 
Southam of the University of Queensland, 
St. Lucia, is adding cyanobacteria to the 
mix. These photosynthetic bacteria capture 
CO, from the atmosphere, and lab studies 
have shown they speed carbon mineraliza- 
tion. If these efforts work, Shaw says, they 
could repair mines’ reputation as environ- 
mental blights, making them part of a solu- 
tion to climate change. Anglo American, De 
Beers’s parent company, has announced it 
wants to harness alkaline wastes to create 
the first carbon-neutral mine by 2040. 


DIAMOND MINES aren’t the only places 
where such studies are underway; another 
is the Woodsreef chrysotile mine in New 
South Wales in Australia. (Chrysotile is a 
form of asbestos that is still widely used 
in building materials in some parts of the 
world.) Wilson and her colleagues sprayed 
the mine’s ultramafic rock tailings with 
dilute sulfuric acid, causing magnesium 
to leach out. The alkaline tailings then 
neutralized the acid and locked up CO, 
that was bubbled through, as much as 
100 times faster than normal weathering. 
Jennifer Wilcox, a chemical engineer at 
Worcester Polytechnic Institute, and her 
colleagues are pursuing a related strategy 
at the Stillwater nickel mine in Montana. 
“The tailings are not particularly reactive,” 
she says. But CO, is mildly acidic; bub- 
bling it through the tailings helps release 
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their metals and boosts their affinity for 
CO,. She and her colleagues are exploring 
whether adding compounds called oxalates 
will speed this process further by weaken- 
ing chemical bonds in the tailings. And 
they are trying to encourage the growth of 
CO,-hungry magnesium carbonate crystals 
by dispersing tiny crystallites of a mineral 
in the tailings. The crystallites, Wilcox 
says, are “like a blueprint for making more 
of what you want.” 

CO, mineralization could help remedi- 
ate environmental problems that min- 
ing creates, such as the release of heavy 
metals from pulverized rock. On 1 March 
in Economic Geology, Wilson and her col- 
leagues reported that techniques that 
accelerate weathering, such as 
adding acid, effectively trap heavy 
metals inside newly formed car- 
bonate minerals, keeping them 
out of groundwater. Other teams 
have shown that the carbonates 
can also trap hazardous residual 
asbestos fibers in chrysotile mine 
tailings. “You can lock away just 
about anything,” Wilson says. 

Other industrial wastes, such 
as red mud from aluminum pro- 
duction and “slags” leftover from 
making steel and iron, also harbor 
plenty of chemical reactivity to bind 
and store CO,. However, accord- 
ing to NAS, fully carbonating these 
wastes could require building costly 
plants to speed the reactions. 

The rock dust created by pulver- 
izing basalt rock, which is already 
mined for construction aggregate, 
could do the job more cheaply, ac- 
cording to Renforth’s team. The 
researchers suggest adding this dust to ag- 
ricultural soils around the world, where it 
would be exposed to the air, could capture 
up to 2 GTs of CO, per year. The basalt dust 
could also fortify soils with nutrients, such 
as potassium and zinc, depleted by inten- 
sive agriculture. And as a bonus, they say, 
the dust would react with CO,, generat- 
ing bicarbonate, much of which over time 
would flow through rivers to the sea; once 
there, bicarbonate, which is alkaline, could 
counteract ocean acidification. 

In another environmental plus, the NAS 
panel said, carbonated wastes of all kinds 
could serve as raw material for concrete 
and road aggregate. The report noted that 
replacing 10% of building materials with 
carbonated minerals could reduce CO, 
emissions by 1.6 GTs per year by lowering 
emissions from cement production. Nu- 
merous companies around the globe have 
already jumped into the field to make and 
sell the new materials (see table, p. 1158). 
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YET EVEN IF LARGE-SCALE mineralization 
works, scaling it up will carry daunting 
costs, both financial and environmental. 
Quarrying, crushing, and grinding ultra- 
mafic rocks would cost only about $10 per 
ton of CO, absorbed, Wilson and her team 
estimate. Moving the rock, stirring it, and 
other steps to speed mineralization would 
likely boost the cost to between $55 and 
$500 per ton of stored CO,. That’s similar 
to the cost of more traditional direct air 
capture using liquid amines, which has 
already gained widespread attention and 
commercial interest. 

But it would take mind-boggling quan- 
tities of rock to budge global CO, levels. 
According to a report published online on 


Re 


In Oman, carbon dioxide reacted naturally with rock, forming wh 
streaks of carbonate. 


6 May in Chemical Geology led by Peter 
Kelemen, a geologist at Columbia Univer- 
sity, consuming 1 GT of CO, would require 
10 to 100 GTs of tailings—5 to 50 cubic 
kilometers of material. That’s enough to 
bury Washington, D.C., 30 to 300 meters 
deep—but it could only capture roughly 
one-fortieth of the CO, humans spew into 
the atmosphere every year. “We don’t make 
anything on the scale that we make CO,,” 
says Klaus Lackner, a physicist who runs a 
center at Arizona State University, Tempe, 
that is evaluating all types of NETs. 

All that mining, grinding, and transpor- 
tation would itself generate CO,, unless 
it were powered with renewable energy. 
And if even a tiny bit of the heavy met- 
als from the pulverized rock leached out, 
mountains of rock waste could risk con- 
taminating groundwater. “Not all rocks 
are environmentally friendly if you spread 
them all over,” Lackner says. 

“We're trying to understand the trade- 
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offs,” Dipple says. “The size of the problem is 
tens of billions of tons of CO, per year. The 
only way to deal with that is to create an in- 
dustry on the scale of the oil and gas indus- 
try. There is more than enough rock to do 
that. The question is how do you do that in 
a way that is a net environmental benefit?” 

A compromise requiring less land but 
also less assurance that carbon will remain 
locked up could come from a hybrid be- 
tween carbon mineralization and direct air 
capture. In Chemical Geology as well as a 
recent patent, Kelemen and his colleagues 
propose using a mineral called magnesite 
that, when heated, gives off pure CO,, which 
could be captured in tanks and pumped 
underground. That reaction would leave 
Magnesium oxide powder, which 
when spread thin would rapidly re- 
act with CO, from the atmosphere, 
re-forming magnesite, complet- 
ing a cycle that could be repeated 
over and over. Kelemen and his 
colleagues calculate that mining 
and processing 2 GTs of magne- 
site would enable capture and 
injection underground of 1 GT of 
CO, every year. The cost would be 
between $24 and $98 per ton of 
CO,, which is less than traditional 
direct air capture methods cost. 
And it would likely require only 
4500 to 6100 square kilometers of 
land, or about four times the size 
of Gahcho Kué. 

Looming just as large as cost 
is the question of how to entice 
companies to build a vast carbon- 
capture industry. Existing govern- 
ment incentives to reduce carbon 
are little help. The United States 
offers a tax credit of $50 per ton of CO, 
that gets stored underground. California’s 
low carbon fuel standard also rewards 
companies that sequester carbon. And car- 
bon taxes in place in 29 countries encour- 
age carbon reductions. But none of those 
incentives rewards mineralization as a way 
to lower atmospheric carbon. 

There’s reason to hope that could change, 
says Noah Deich, executive director of 
Carbon180, a nonprofit firm that is pushing 
Congress to increase funding and incen- 
tives for NETs, including mineralization. If 
regulators verified mines and other alka- 
line waste producers as CO, sequestration 
sites, Lackner adds, incentives would sky- 
rocket, companies could claim tax benefits, 
and industry might start to tackle climate 
change on the grand scale that’s necessary. 

To avert the worst damage from climate 
change, Lackner says, “we need to throw 
everything we can at it.” Including, per- 
haps, a lot of rocks. & 
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High-precision molecular measurement 


Spectroscopy of hydrogen deuteride ions provides the proton-to-electron mass ratio 


By Masaki Hori 


he hydrogen molecular ion (H,* = 

p* + p* + e-) is the simplest mol- 

ecule with two protons bound by an 

electron. Historically, it was the first 

molecule to be studied by using quan- 

tum mechanics, and it remains on the 
short list of experimentally accessible mol- 
ecules for which a truly precise theoretical 
understanding is possible. However, several 
characteristics make precision optical spec- 
troscopy of H,* a formidable challenge in 
laboratory experiments. Hydrogen deuteride 
molecular ion (HD* = p* + d* + e-), in which 
one of the protons of H,* is replaced by a 
deuteron (J—3), has an asymmetric dipolar 
structure that allows numerous vibrational- 
rotational transitions. These “rovibrational” 
transitions (v,,) have exceptionally narrow 
relative widths of less than 10% and occur 
at much higher rates when compared to the 
even narrower H,* transitions. On page 1238 
of this issue, Patra et al. report two frequen- 
cies with a precision of 2.9 parts per trillion 


Max-Planck-Institut far Quantenoptik, 
Hans-Kopfermann-Strasse 1, 85748 Garching, Germany. 
Email: masaki.hori@mpq.mpg.de 
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and determine the mass ratio between the 
proton and electron (2). 

Quantum electrodynamics (QED) is the 
relativistic quantum field theory that de- 
scribes the electromagnetic interaction, 
which is among the four known funda- 
mental interactions of nature. QED reveals 
the forces that act between the bound elec- 
tron, proton, and deuteron in HD* that 
arise from an infinite series of elementary 


“So high a consistency 
between multiple experiments 
at the forefront of precision 
measurements is unusual.” 


processes of progressively higher complex- 
ity. These involve the exchange of virtual 
photons that exist as transient quantum 
fluctuations of the underlying electromag- 
netic field. The fluctuations can temporar- 
ily transform into pairs of virtual electrons 
and positrons that immediately annihilate 
back into photons. This gives rise to min- 
ute but measurable changes in the struc- 
ture of HD*. Some processes involving 
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multiple virtual particles that cause sub- 
parts-per-billion scale shifts in the HD* 
energies have taken a decade to calculate 
(2—5). Despite these difficulties, QED re- 
mains the most stringently tested part of 
the standard model. 

The authors compared their measured 
HD* frequencies with the results of QED 
calculations. Under the assumption that 
there are no deviations from QED predic- 
tions, the authors determined the proton- 
to-electron mass ratio M_/m, with a pre- 
cision of 21 parts per trillion. This value 
lies within 30 and 350 parts per trillion of 
other experiments that instead measured 
the characteristic motions of a proton 
(6) or a H,* ion (7) confined within the 
magnetic fields of ion traps. The result 
is also in excellent agreement with the 
ratio determined to a similar precision 
by a recent measurement carried out in 
Diisseldorf (3) of several hyperfine com- 
ponents of a HD* rotational transition. 
So high a consistency between multiple 
experiments at the forefront of precision 
measurements is unusual. 

The experiment required samples of 
the reactive HD* ions to be isolated in an 
ultrahigh-vacuum environment and cooled 
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to temperature T ~ 10 mK to minimize the 
Doppler broadenings of the spectral reso- 
nances due to the thermal motions. The 
authors achieved this by first confining a 
cloud of beryllium ions (Be*) in the oscil- 
lating electric field of a radiofrequency ion 
trap. The Be* ions were irradiated with an 
ultraviolet laser beam, so that higher-veloc- 
ity ions would scatter more laser photons. 
This velocity-selective scattering eventu- 
ally cooled an ensemble of ~1000 Be* ions 
into the ordered structure of a so-called 
“Coulomb crystal” (8). The HD* ions were 
suspended in the center of the crystal and 
allowed to thermalize (see the figure). The 
ions were then irradiated with two coun- 
terpropagating laser beams with infrared 
frequencies v, and v, that excited the HD* 
transition when the sum v,+ v, was tuned 
to v.. The motion of each HD* ion in the 
trap was strongly confined within its own 
micrometer-sized volume, which allowed 
the observation of particularly narrow 
spectral lines. 

Although the early pioneers (J) realized 
the potential of HD* experiments to eventu- 
ally determine the physical constants, the 
numerous degrees of freedom in a three- 
body molecule made the theoretical evalua- 
tion vastly complicated. At the time, the HD* 
molecular frequencies were typically calcu- 
lated with parts-per-million scale precision. 
This appeared to limit any determination of 
the proton-to-electron mass ratio to a simi- 
lar precision. Development of computational 
techniques based on variational trial func- 
tions that included the molecular degrees of 
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False-color image of a Coulomb crystal containing 
some 1000 Be’ ions cooled to a temperature of less 
than 10 mK. The long dimension of the ellipsoidal 
crystal is ~1 mm. A small number of HD* molecular 
ions (not visible) are suspended in the darker 
horizontal band at the center of the crystal. 


freedom occurred in the 1980s. These tech- 
niques were used to study muonic molecu- 
lar heavy hydrogen ions [(ddu)* = d*+ d*+ 
u- and (dtu)*= d* + t*+ u-] to estimate some 
of the reaction rates relevant for the possibil- 
ity of energy production by muon-catalyzed 
fusion. The methods were used to calculate 
the transition frequencies of neutral anti- 
protonic helium atoms (pHet = p + He** + 
e-) (4, 5), which eventually allowed the de- 
termination of the antiproton-to-electron 
mass ratio to a precision of 8 parts in 10” (9). 
Advances in the calculations and measure- 
ments of the HD* frequencies (2—4) cumu- 
lated in the 2 parts per 10" determination of 
the M, /m, ratio. 

Several advances in fundamental physics 
could result from these observations. Other 
physical constants such as the Rydberg con- 
stant, the charge radii of protons and deu- 
terons (J0-—13), and the deuteron-to-electron 
mass ratio (4) may eventually be deter- 
mined. The charge radii are especially inter- 
esting, as deviations of up to 4% have been 
reported among the results of a few experi- 
ments (JO-13). Some of these physical con- 
stants until recently could only be precisely 
determined on the basis of either the elegant 
simplicity of a single proton confined in an 
ion trap (6) or two-body systems, such as 
atomic hydrogen (H = p* + e~) (10-12), mu- 
onic hydrogen and deuterium atoms (uH = 
p*+ uw and wD= d*+ u) (3), or hydrogenic 
carbon ions (®C** = °C + e-) (15). Upper 
limits have also been set on phenomena that 
may cause deviations from the predictions 
of QED like the possible existence of a fifth 
fundamental force that may act between the 
constituent particles of HD* ions (3). 
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BIOGEOCHEMISTRY 


Soil age alters 
the global 
silicon cycle 


As rocks undergo prolonged 
chemical weathering, plants 
become more important for 
supplying bioavailable silicon 


By Joanna Carey 


ilicon (Si)—the second most abun- 

dant element in Earth’s crust—relies 

largely on geological factors to con- 

trol its mobilization. Thus, Si cycling 

through Earth’s systems was often 

believed to be buffered from human 
disturbance (7). However, research over the 
past several decades has awakened scien- 
tists to the central role of vegetation in reg- 
ulating Si availability in the biosphere (2, 3). 
It is now beyond doubt that human distur- 
bance affects Si biogeochemistry and its as- 
sociated impact on carbon (C) sequestration 
rates. Attempts to decipher how human ac- 
tivities (namely deforestation and agricul- 
tural expansion) influence Si cycling have 
left scientists to reconcile conflicting data 
on the importance of geochemical versus bi- 
ological controls on Si biogeochemistry (4, 
5). On page 1245 of this issue, de Tombeur et 
al. provide new insights into this debate by 
demonstrating the importance of soil age in 
regulating Si cycling (6). 

The Si and C cycles are intricately linked 
at the global level. On geological time 
scales, the chemical weathering of min- 
eral silicates consumes atmospheric car- 
bon dioxide (CO,), thus regulating Earth’s 
climate (7). On biological time scales, the 
uptake of CO, by Si-requiring microscopic 
phytoplankton known as diatoms accounts 
for roughly half of the photosynthesis that 
occurs in global oceans (7). As such, the 
amount of Si exported from terrestrial 
uplands to marine waters can directly con- 
trol the rate of photosynthetically driven 
CO, uptake (8). 

However, Earth’s biological Si cycle is not 
relegated only to aquatic systems. Terres- 
trial vegetation performs an integral func- 
tion in Si biogeochemistry and provides 
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yet another means by which Si and 
C are linked (2, 9). Like diatoms in 
aquatic systems, land plants con- 
sume large quantities of dissolved 
Si—roughly one-third of the amount 
consumed by diatoms in the oceans 
annually (9). All species of terres- 
trial plants consume Si to some de- 
gree as they transpire and deposit 
it in their tissues, often as siliceous 
structures known as phytoliths (70). 

Chemical weathering is the 
primary means by which Si is 
liberated from rocks and made 
available for uptake by both land 
plants and aquatic organisms (J, 5). 
However, the phytoliths created by 
vegetation are orders of magnitude 
more soluble than mineral silicates 
(11) and thus are major suppliers 
of dissolved Si needed for organ- 
ismal uptake (4). An unsolved goal 
of the scientific community has 
been to define the balance between 
geochemical and biological factors 
that control the supply of dissolved 
Si along the land-ocean continuum. 
Are factors that dictate chemical 
weathering rates, such as lithol- 
ogy, climate, and hydrology (J), the 
main control over the Si mobiliza- 
tion from terrestrial systems? Or 
are land plants, which both stimu- 
late chemical weathering and ab- 
sorb Si within their tissues (3), the 
source and driving force behind Si 
availability and export? 

Across a chronosequence of soil 
weathering stages that span nearly 
the full range of soil ages on Earth, 
de Tombeur et al. discovered that soil 
age accounts for variability in the rela- 
tive contributions of chemical weathering 
and biological processes to the sustain- 
ing of Si availability for biological uptake. 
Specifically, as soils age, the amount of Si 
released by chemical weathering eventu- 
ally decreases as a result of Si losses from 
secondary minerals, whereas the perpetual 
recycling of siliceous plant material be- 
comes the major supplier of bioavailable 
Si over time (see the figure). For example, 
Si release from soil weathering was nearly 
nonexistent in the oldest soils studied by 
de Tombeur et al., which were ~2 million 
years old, dominated by quartz, and de- 
pleted in carbonates, clays, and iron ox- 
ides. However, the vast soil phytolith pool, 
sustained by continued plant production 
and decay, supplied ample Si to plants and 
corresponded to elevated leaf Si concen- 
trations in the mature vegetation in these 
oldest soils (see the figure). Thus, as soils 
age, chemical weathering becomes less im- 
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Silicon, soil, and the biosphere 
Aconceptual model describes the relation between soil age and controls 
on silicon availability for biological uptake. The silicon:aluminum (Si:Al) 
ratio of an amorphous, alkali-reactive Si pool indicates the amount of Si 
available from plant dissolution. 
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portant for supplying Si to the biosphere, 
whereas terrestrial plants play an increas- 
ingly preeminent role. 

Herein lies an additional key relevancy 
nearly hidden from plain view within the 
de Tombeur et al. study: Soil age appears 
to be partially responsible for the extreme 
variability of plant Si concentrations ob- 
served within terrestrial vegetation. For 
decades, scientists have known that the 
range of Si concentrations in plants is the 
widest of any element (0.1 to >10% by dry 
weight) (10), but the mechanisms respon- 
sible for this variability have remained 
unclear. Whereas factors such as stress ex- 
posure (for example, herbivory, desiccation, 
heavy-metal toxicity) and Si accumulation 
mode (passive versus active) influence the 
variability in plant Si concentrations, these 
factors do not fully explain why such large 
differences exist, both within and across 
individual plant species (12, 13). Across the 
chronosequence of soil weathering stages 
studied by de Tombeur e¢ al., mature leaves 
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of the most dominant plant species 
displayed increasing Si concentra- 
tions, which corresponded to the 
elevated Si availability from the 
phytolith-rich older soils (see the 
figure). The enrichment of leaf Si 
concentrations mostly results from 
shifts in plant community compo- 
sition toward plants that accumu- 
late high amounts of Si, but also 
from increasing Si concentrations 
within individual species across 
the chronosequence. Si improves 
overall plant fitness, allowing veg- 
etation to withstand a variety of 
external stressors (J0, 13). Given 
the elevated quantities of Si found 
in some of Earth’s most impor- 
tant agricultural crops (such as 
rice, wheat, and barley), improved 
knowledge of the mechanisms that 
control plant Si concentrations is 
crucial for global food security. 

The new study has implications 
for researchers who wish to more 
fully understand fundamental Si 
cycling and the linkages between Si 
and C. The Si liberated from rocks 
by weathering eventually makes its 
way to the coastal oceans, fueling 
diatom-driven primary produc- 
tion (5, 8). Along that land-ocean 
pathway, terrestrial plants inter- 
cept Si and regulate its availability 
for biological uptake (3, 4). Armed 
with the knowledge that soil age 
influences the relative importance 
of geochemical and biological pro- 
cesses in sustaining Si availability 
in terrestrial systems, scientists 
are now better prepared to elucidate the 
controls on Si mobilization from upland to 
downstream aquatic systems, as well as to 
recognize the drivers of differential plant 
Si uptake. 
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THERMAL ECOLOGY 


Dynamics of death by heat 


Time at high temperature modulates fly mortality in nature 


By Raymond B. Huey’? and Michael R. Kearney” 


t has been known for a century that 
mortality from heat depends not only on 
the exposure temperature but also on 
the duration of exposure (J). Typically, 
higher temperature shortens time to 
death. But predicting heat death in 
nature is challenging because an animal’s 
temperature and stress level—especially 
for small species—can fluctuate markedly 
within days and across seasons. Can risk 
of heat death in fluctuating environments 
be understood only by brute-force experi- 
ments involving all possible temperature 
sequences, or can exposure to a few fixed 
temperatures capture key dynamics of heat 
death? On page 1242 of this issue, Rezende 
et al. (2) extend a recently developed math- 
ematical model (3) and show that fixed- 
temperature experiments can be general- 
ized to dynamic patterns and can predict 
mortality of a fly (Drosophila subobscura) in 
nature across seasons and climate shifts. 

Ecologists have long known that heat 
stress constrains the distributions and abun- 
dances of organisms as well as the spread 
of pests, diseases, and invasive species (4). 
However, the increased intensity and dura- 
tion of heat waves with contemporary cli- 
mate change have stoked renewed interest 
in these issues from conservation and health 
perspectives. With human data, nonlinear 
statistical models can evaluate the impact 
of environmental temperatures on observed 
mortality rates and causes of death (5). But 
with animals in nature, mortality rates are 
usually unknown, and biologists must de- 
velop other approaches to evaluate risks of 
heat mortality (6, 7). 

One simple but widely used approximation 
of risk is the thermal safety margin (TSM), 
which quantifies the temperature difference 
between a threshold measure of an orga- 
nism’s heat tolerance and maximum envi- 
ronmental temperatures (6). Organisms with 
small or especially negative TSMs are judged 
at risk of heat stress (8). 

Critical maximum temperature (CT,,,.) is 
a common and nonlethal index of heat tol- 
erance: An animal is heated until it loses its 
righting response when placed on its back. 
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CT.,,, has been measured for thousands of 
species, but its sensitivity to measurement 
protocols (e.g., fast versus slow heating) has 
sparked debates about its ecological and 
evolutionary relevance (3, 9). Ironically, the 
study highlighted here (2) evolved from an 
attempt to resolve this debate. In a previ- 
ous paper, Rezende and colleagues (3) de- 
veloped the concept of a “thermal tolerance 
landscape,” which is a three-dimensional 
portrayal of survival time as a function of 
constant temperature plus exposure dura- 
tion. As Rezende et al. show here (2), this 
landscape can even help to predict survival 
in dynamic environments. 

The mathematical extension from static 
to dynamic begins by relating survival prob- 
abilities to exposure time, temperature, and 
a functional constant (z) describing sensi- 
tivity to temperature change. Then survival 
rate can be estimated by summing instan- 
taneous survival rates across a temperature 


“Asingle survival function 
successfully describes empirical 
survival probabilities...” 


series. A single survival function successfully 
describes empirical survival probabilities in 
both static and dynamic (at least monotoni- 
cally increasing) thermal exposures. Next, 
Rezende et al. use heat tolerance data for D. 
subobscura and predict that daily mortality 
rates should start rising in spring for cold- 
acclimated flies but not until midsummer 
in warm-acclimated ones. However, their 
empirical estimates of relative abundance in 
central Chile show population crashes in late 
spring through early summer. The crash oc- 
curs somewhat earlier than predicted, which 
might reflect insufficiently warm acclimation 
temperatures. When recent climate warming 
is considered, predicted population crashes 
are accelerated by 1 or 2 months and the sum- 
mer low is protracted. 

Despite the success and power of the 
model, it remains a black box with respect to 
mechanisms of heat death. High heat dena- 
tures enzymes and disrupts cell membranes, 
which likely knock out cellular processes 
that vary idiosyncratically among species 
(10). Even so, Rezende et al. show that their 
simple model adequately captures the dy- 
namic accumulation of damage and its net 


Published by AAAS 


effect on mortality, at least in Drosophila. 

Cellular repair processes may reduce or 
stall heat-related damage (10). Rezende et 
al. do not explicitly model repair dynamics 
but assume that flies heat-stressed by day 
fully recover overnight. Thus, recent “ther- 
mal history” (other than acclimation state) 
is assumed to be unimportant. But heat tol- 
erance in flies varies with thermal history 
and prior stress exposure (17). Whether or- 
ganisms recover overnight depends on the 
stress’s magnitude, nighttime temperatures, 
and whether heat stress occurs on sequen- 
tial days, as in a heat wave (72, 13). Such ef- 
fects need to be studied experimentally and 
modeled dynamically (74). 

The model’s implementations (2) did not 
explicitly account for effects of ontogeny, sex, 
and condition on heat stress or for the possi- 
bility of behavioral evasion in heterogeneous 
thermal environments. Nor did it consider 
correlates of heat stress such as desiccation 
and the energetic consequences of activity 
restriction (7). But the approach here can be 
integrated with existing models of these indi- 
rect consequences (/5). 

The correspondence of mortality predic- 
tions with field observations suggests that 
this model captures real-world phenomena. 
And, perhaps most important, the model 
suggests that relatively low field tempera- 
tures—that is, even those well below CT. — 
can cause substantial mortality and popula- 
tion collapse. Thus, CT.,,-based inferences 
may underestimate the population conse- 
quences of climate change but overestimate 
potential ranges of invasive species. In ad- 
dition, Rezende et al. help to highlight open 
challenges, both theoretical and empirical, 
to our ability to understand and predict 
population mortality and reproduction in 
fluctuating environments. 
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PHYSIOLOGY 


Sexual dimorphism in body clocks 


Sexual dimorphism in chronobiology has implications for the health of our 24-hour society 


By Sean T. Anderson and Garret A. FitzGerald 


ircadian rhythms, or the body clock, 

confer temporal structure on hu- 

man behavior and physiology to 

align homeostatic processes with an- 

ticipated changes in the environment. 

Disruption of these rhythms can influ- 
ence health and well-being. Chronobiological 
research has often failed to consider how 
this temporal organization may be affected 
by sex. The few studies that do consider how 
these rhythms differ between sexes suggest a 
dimorphism that warrants further investiga- 
tion. Recent findings from both humans and 
animal models illustrate how the systems 
that generate circadian rhythms diverge be- 
tween the sexes, which has potential conse- 
quences for health and resilience to changes 
in sleep pattern. 

Circadian rhythms are generated centrally 
by a transcription-translation feedback loop 
in the suprachiasmatic nucleus (SCN) of the 
hypothalamus. The proteins brain and mus- 
cle ARNT-like 1 (BMALI) and circadian loco- 
motor output cycles kaput (CLOCK) form the 
positive arm of the molecular clock. This het- 
erodimer promotes the expression of its own 
repressors period 1 (PER1) and PER2 and 
cryptochrome 1 (CRY1) and CRY2. Inhibition 
of the BMALI-CLOCK heterodimer in turn 
suppresses PERI/2 and CRY1/2, allowing 
BMALI and CLOCK expression levels to rise 
once again. This cycle takes roughly 24 hours, 
forming the core molecular clock. Photic 
(ight) input to the SCN can also induce the 
expression of PER1/2 and CRY1/2 and repre- 
sents the main stimulus that entrains the cir- 
cadian rhythm to the day-night cycle. 

Estrogen and androgen receptors are 
expressed in a sexually dimorphic pattern 
along the central neuronal circuitry regulat- 
ing circadian rhythms and show rhythmic 
expression in peripheral tissues (7) (see the 
figure). Early studies of hamsters placed into 
constant darkness observed shorter free-run- 
ning periods among females, indicating that 
their core molecular clock machinery oscil- 
lates faster than that of males (2). Females 
also showed significantly earlier onset of 
activity and responded differently to stimuli 
that shift the intrinsic timing of the circa- 
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dian clock. It was subsequently reported that 
in female mice, behavioral rhythms were 
more consolidated, had higher amplitudes 
(difference between the peak and the mean 
24-hour activity level), and peaked earlier in 
the day than in males. 

Recently, large-scale collections of remote 
sensing data have enabled the observation of 
natural activity patterns in humans. A study 
of 91,105 participants in the UK Biobank re- 
vealed that males were more likely to have 
low-amplitude behavioral rhythms than fe- 
males (3). This can be due to increased ac- 
tivity at night or decreased activity in the 
daytime, reflecting the conservation of more 
robust oscillations in activity rhythms among 
females. Females also spend more time 
asleep, spend more of their time in slow-wave 
(deep) sleep, and are more resilient to noctur- 
nal disturbances than males (4). 

A study of chronotype, or day-night pref- 
erence, in more than 53,000 individuals 
highlighted how age and sex both substan- 
tially affect the timing of circadian rhythms 
(5). Whereas children are typically morn- 
ing types regardless of sex, after puberty 
males tend to be more evening oriented 
than females, mirroring the findings in 
animal models. The hormonal changes as- 
sociated with menopause add extra com- 
plexity to the aging process for females; 
chronotypes converge during middle age as 
both sexes become more morning oriented. 
New efforts to study rhythmic changes out- 
side the laboratory are characterizing the 
“chronobiome’”—a phenotype incorporating 
the in-depth assessment of thousands of 
time-of-day signals across diverse physio- 
logical readouts that will clarify the array of 
sexually dimorphic rhythms in humans (6). 

Forced desynchrony protocols, in which 
the sleep-wake cycle is desynchronized from 
endogenous circadian rhythms, have also 
been used to examine how key parameters 
of circadian rhythmicity differ between the 
sexes. One such study found that females had 
higher-amplitude rhythms in their perfor- 
mance on cognitive tests that assess working 
memory, attention, effort, mood, and sleepi- 
ness (7). Together with a higher-amplitude 
rhythm in the sleep-promoting hormone 
melatonin, this increased amplitude in cog- 
nitive processing resulted in females experi- 
encing a greater deficit during the biological 
night compared with their peak functioning 
time. Although higher amplitudes (peaks-to- 
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trough ratio) are typically beneficial because 
they allow for greater compartmentalization 
of different homeostatic processes, in this ex- 
ample a higher amplitude may be detrimen- 
tal when females must be awake at night. 

The expansion of sequencing efforts has 
revealed how sexual dimorphism in chrono- 
biology extends into oscillations of the tran- 
scriptome, metabolome, and microbiome. 
In a study of transcriptional and metabolic 
rhythmicity in mice, 71% of liver transcripts 
showed conserved rhythmicity between 
males and females, 9% of genes showed 
rhythmicity only in males, and 16% of genes 
showed rhythmicity only in females (8). A 
further 4% were rhythmically expressed in 
both sexes but differed in their phase or am- 
plitude, including several core clock genes 
that showed higher-amplitude oscillations in 
females. Only 55% of liver metabolites and 
29% of serum metabolites had conserved 
oscillations between sexes. Germ-free mice, 
which have no gut microbiota, had dimin- 
ished sexual dimorphism in their gene ex- 
pression and circadian rhythms. Female mice 
have greater diurnal oscillations in their total 
bacterial load, and genetic disruption of the 
host clock machinery affects gut microbiota 
differentially in male and female mice (9). 
Thus, the gut microbiota can contribute to 
the orchestration of circadian rhythms dur- 
ing development and adulthood; this contri- 
bution is likely to differ between the sexes. 

A clinical study comparing the metabolic 
response to misalignment between sexes 
tracked energy consumption and expendi- 
ture during 8 days on a normal schedule 
and then for 8 days featuring a 12-hour 
phase shift, achieved through an 8-hour 
wake opportunity followed by a 4-hour 
sleep opportunity on the fourth day (0). 
Circadian misalignment increased the hun- 
ger hormone ghrelin and decreased the sa- 
tiety hormone leptin in females, which was 
accompanied by decreased feelings of full- 
ness. Female participants’ carbohydrate oxi- 
dation rate and respiratory quotients also 
dropped after misalignment, whereas their 
energy expenditure and lipid oxidation rate 
increased. Conversely, males showed an in- 
crease in leptin and no change in ghrelin af- 
ter the phase shift. They reported increased 
cravings for energy-dense foods, which is 
inconsistent with their hormone changes, 
and showed no difference in energy utiliza- 
tion. This switch in energy utilization may 
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reflect an adaptive response to misalign- 
ment in females. 

Studies in animal models have shown that 
sex hormones are not required to maintain 
circadian rhythms but can affect their ampli- 
tude and response to photic stimuli. Female 
mice in which the estrogen receptor 1 (Es771) 
gene is deleted show fractured behavioral 
rhythms and blunted phase delays produced 
by light pulses given in the early active phase 
of the day. By contrast, light pulses given 
late in the active phase increase phase ad- 


whereas males took 10 days to adapt (J3). 
This difference disappeared in the absence of 
Liver X Receptor a (Lavra), with male mice in 
which Lxra is deleted showing significantly 
faster entrainment than that of wild types. 
This may be due to the impact of Lara dele- 
tion on corticosterone rhythms, which affect 
reentrainment. It is not well understood how 
sex hormones influence rhythmicity directly 
in the SCN because whole-body deletion of 
the receptors or removal of sex organs oblit- 
erates signaling across the entire organism. 


Sex hormone receptors in the circadian network 

The patterning of estrogen receptors (purple) and androgen receptors (green) throughout the circadian 
network varies between males and females. The suprachiasmatic nucleus (SCN) signals to peripheral organs, 
many of which also have circadian oscillations in sex hormone receptors. Circulating estrogen and testosterone 
are also likely to affect the SCN in a sexually dimorphic, rhythmic fashion. This dimorphism is manifested at the 
behavioral level through higher-amplitude rhythms in female activity patterns compared with that of males. 
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vances compared with that of controls (17). 
This suggests that estrogen consolidates be- 
havioral rhythms in female mice and could 
facilitate the faster entrainment to phase 
shifts observed in female mice. The same 
study showed that Es71 deletion did not af- 
fect phase shifting in males, but that wild- 
type males showed smaller phase shifts than 
those of females when exposed to light early 
in the active phase. 

Gonadectomy in male mice deconsolidates 
behavioral rhythms and enhances the phase 
shift produced by light in the early active but 
not the late active phase. Thus, testosterone 
may restrain the phase-shifting effect of light 
(72). Nonreproductive factors are also perti- 
nent to the sexual dimorphism in circadian 
rhythmicity. For example, female mice en- 
trained to an 8-hour phase shift in 6 days, 


SCIENCE sciencemag.org 


Behavior 


Time 


Sexual dimorphism in the SCN should be 
revisited by using single-cell transcriptomics 
and SCN-targeted deletions of estrogen and 
androgen receptors. 

The repeated pattern of dimorphic rhyth- 
micity observed in humans and animal mod- 
els suggest that these differences are not 
attributable simply to societal pressures on 
either sex. Consistent with the findings that 
female mice show enhanced entrainment to 
phase shifts, studies in rodents have shown 
that females tend to be more resistant to 
genetic and environmental circadian disrup- 
tion. In Clock*/4" mutant mice, in which 
mutation of the Clock protein interferes with 
transcriptional regulation by the BMALI- 
CLOCK heterodimer and leads to lengthen- 
ing of the circadian period, females do not 
develop any detectable cardiac dysfunction 
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until 21 months of age, despite male Clock*!9/4¥ 
mice showing cardiac hypertrophy and dys- 
function after 12 months (74). However, in 
ovariectomized Clock*"/“ mice, cardiometa- 
bolic function was impaired relative to ovari- 
ectomized controls by 8 months of age, high- 
lighting the protective effect of estrogen. 

One possible reason for the resilience to 
circadian disruption in females relates to 
their biological imperative. Resistance to the 
negative consequences of circadian disrup- 
tion coupled with improved sleep, even when 
experiencing nocturnal disturbances, may 
facilitate their adaptation to frequent noc- 
turnal awakenings over a sustained period, 
given their predominant role in nurturing 
offspring. The early-activity chronotypes seen 
in women before menopause also align with 
those in children. 

Circadian rhythms are influenced by sex, 
and this interaction is remolded throughout 
life. In the healthy state, females often show 
higher-amplitude oscillations with an ear- 
lier peak in gene expression. Dimorphism 
can also shape the response to circadian 
misalignment and the downstream conse- 
quences of disruptions to normal rhythms. 
A chronic disruption to human circadian 
rhythms is shiftwork, which is associated 
with cardiometabolic disease and cancer. 
Studies have sought to clarify whether this 
risk is affected by sex (15), but the results are 
constrained by a lack of longitudinal data. 
There are large differences in the rhythmic 
regulation of the liver transcriptome be- 
tween males and females, but it is unknown 
whether other organs show similar differ- 
ences or how faithfully this translates to 
protein expression and function. In humans, 
well-controlled, longitudinal analyses of the 
impact of misalignment will be necessary to 
address the hypothesis that females are more 
resilient than males to the disruption of cir- 
cadian function caused by shiftwork and re- 
peated long-distance travel. 
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PROTEIN DESIGN 


Can proteins be truly designed sans function? 


A new unit of local protein structure can aid in the de novo design of ligand-binding proteins 


By Anna Peacock 


roteins come in a wide range of sizes, 
shapes, and folds and perform a broad 
range of functions. Investigations 
into how and why proteins fold from 
peptide sequences to yield a par- 
ticular structure have continued for 
decades (7) and have inspired efforts to de- 
sign proteins de novo—that is, to rationally 
design structured miniature protein folds 
from first principles. Ultimately, the goal 
is not merely to design specific folds but to 
create proteins that execute functions—ide- 
ally, functions beyond the repertoire found 
in nature. When the desired function in- 
volves binding of small molecules, as is the 
case in many applications, this requirement 
adds an additional level of complexity and 
challenge. On page 1227 of this issue, Polizzi 
and DeGrado (2) have developed a search 
algorithm, Convergent Motifs for Binding 
Sites (COMBS), by which ligand-selective 
binding proteins can be designed truly de 
novo, thereby providing a much-needed tool 
for advancing functional protein design. 
The de novo design of a truly artificial 
protein fold was first reported for TOP7, a 
93-amino acid mixed a/f-fold (3). Since 
then, the de novo design of protein struc- 
ture has made impressive progress through 
enhanced understanding, expansion of 
the experimentally validated structures in 
the Protein Data Bank, greater computing 
power, and affordable access to synthetic 
genes that allow for greater experimental 
validation. Artificial peptides have also 
been designed to assemble into previ- 
ously unknown architectures, including 
a-helical barrels featuring accessible chan- 
nels through their core (4), large spherical 
cages (5), and polyhedral shapes (6). More 
recently, the development of protein fold- 
ing and design online computer games has 
even allowed citizen scientists to design 
previously unexplored protein folds (7). 
Protein function can be achieved by 
the actual structure, such as collagen, 
but more often by the binding of complex 
small-molecule ligands generating sen- 
sors, receptors, or catalysts. A cavity must 
be designed with a size and shape to serve 
as a complementary “lock” for the target 
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De novo protein design 
Polizzi and DeGrado developed a unit of local 
protein structure called a van der Mer (vdM) that 
links ligand group chemical functionality and 
main-chain backbone coordinates to help design 
ligand-binding proteins. 


Selecting targets 
Functional groups on the ligand are identified 
and a target peptide fold selected. 


Selecting vdMs 
Complementary vdMs are sampled and the 
highest- scoring ones is selected. 
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ligand “key” that aligns the target for fa- 
vorable interactions but does not interfere 
with proper folding. The common strategy 
of designing a structure and then introduc- 
ing a function often comes at the expense 
of proper structural folding or stability and 
is often not successful. 

A more appealing approach would be to 
simultaneously design structure and func- 
tion de novo. Previously reported strategies 
tend to position the target ligand relative 
to the interacting atoms of the amino acid 
side chain (8, 9). This approach can gener- 
ate a large number of structures to evalu- 
ate computationally, and can also lead to 
combinations of coordinates and ligand 
rotamers that are not actually observed ex- 
perimentally. Generally, approaches to date 
do not initially achieve strong binding, and 
subsequent rounds of experimental valida- 
tion and redesign are often required. 

Polizzi and DeGrado designed an artifi- 
cial protein fold that shares no sequence 
homology to native proteins and also has 
a binding site selective for a complex small 
molecule—in this case, the blood-thinning 
drug apixaban (see the figure). They de- 
veloped a new unit of protein structure, 
which they call a van der Mer (vdM), that 
directly maps ligand chemical group func- 
tionality (such as carbonyl, carboxylate, 
carboxamide, and amine) to peptide resi- 
due backbone coordinates. Crucially, vdMs 
are generated from close contact with the 
side chain, the main chain, or both. Ligand 
chemical-group locations relate to back- 
bone coordinates and not side chains, so 
vdMs link directly to the protein fold. The 
vdMs were extracted from the experimen- 
tally determined structures deposited in 
the Protein Data Bank, and the incidence 
of the various vdMs across the Protein Data 
Bank was used to score them. Surprisingly, 
only a modest number of vdMs are highly 
prevalent, making it computationally at- 
tractive to adopt this approach. 

Binding sites were engineered into de- 
signed four-helix bundle folds that are 
structurally unrelated to factor Xa, for 
which apixaban is an inhibitor (J0), and 
that do not generally bind to small mol- 
ecules. The authors searched for combina- 
tions of vdMs (favoring those with higher 
scores) that could be superimposed onto 
protein backbone templates and _ that 
presented chemical groups that could be 
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overlaid with those of the target ligand. 
Flexible backbone sequence design was 
used to build the remainder of the se- 
quence. Finally, low-energy, well-packed 
designs were validated by ab initio fold- 
ing of sequences to establish designs that 
retained uncollapsed and preorganized 
binding sites in the absence of the bound 
target ligand. No subsequent downstream 
redesign was needed to enhance structural 
stability, function, or ligand-binding activ- 
ity. This accomplishment represents an 
important step forward for de novo func- 
tional protein design. 

To achieve the full potential of de novo 
protein design, simultaneous design of func- 
tion is required. Unfolded proteins, or those 
with intended or accidental mutations, can 
be nonfunctional, whereas biology’s success- 
ful and intended folds offer function. Given 
that the repertoire of biological function has 
evolved through select evolutionary pres- 
sures, designed proteins that achieve the 
same function are unlikely to offer substan- 
tial advantages, so new functionality beyond 
the scope of biology is the goal. 

Recent developments, including the si- 
multaneous design of protein structure and 
ligand-binding site by Polizzi and DeGrado, 
will provide exciting opportunities in sens- 
ing, light capture and storage, diagnostics, 


“Ligand chemical-group 
locations relate to backbone 
coordinates..., so vdMs 
link directly to the protein fold.” 


therapeutics, and catalysis, among others. 
The protein design community is now poised 
to design functional proteins that can begin 
to address some of the most pressing chal- 
lenges facing society today, including ones in 
energy, health care, and sustainability. New 
protein design algorithms need to be made 
accessible to the nonexpert user, in a similar 
way to the protein design online computer 
games (7), if researchers with new creative 
functions in mind are to realize the full po- 
tential of protein design. 
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CORONAVIRUS 


A molecular trap against 


COVID-19 


Structure-function studies reveal a new receptor 


decoy to block virus entry 


By Brandon J. DeKosky 


he cell surface peptidase angiotensin- 

converting enzyme 2 (ACE2) is the pri- 

mary receptor for the spike (S) fusion 

protein that facilitates cell entry of se- 

vere acute respiratory syndrome coro- 

navirus 2 (SARS-CoV-2). Numerous 
studies are evaluating therapeutic and pre- 
ventive treatments that block S protein in- 
teractions with ACE2 molecules that are ex- 
pressed on host cells. For example, the ACE2 
binding site can be occluded by monoclonal 
antibodies, several of which are rapidly ad- 
vancing in clinical trials. Several vaccines 
undergoing clinical development also in- 
duce antibody responses that block ACE2-S 
protein interactions. On page 1261 of this is- 
sue, Chan e¢ al. (1) perform high-throughput 
mutagenesis and screening to reveal ACE2 
mutations that enhance affinity for S protein, 
providing new insights into the ACE2-S pro- 
tein interaction on which infection critically 
depends. The authors propose a strategy to 
apply engineered recombinant ACE2 variants 
as decoy receptors for coronavirus disease 
2019 (COVID-19). 

The dimeric ACE2 enzyme is a vasopep- 
tidase expressed on the surface of epithelial 
cells in many tissues, including the lung, 
heart, blood vessels, kidneys, and gastroin- 
testinal tract. It has a primary role in reduc- 
ing blood pressure and inflammation as part 
of the renin-angiotensin-aldosterone system. 
ACE2 expression is closely associated with 
the tissue tropism of SARS-CoV-2 infection 
(2). The trimeric S protein comprises two 
subunits, S1 and S2. The S1 subunit contains 
a receptor binding domain (RBD), which 
binds to ACE2. 

In addition to ACE2 binding, a protease 
cleavage of S protein is required for cell entry 
to allow SI to release and reveal the hydro- 
phobic cell fusion peptide of the S2 subunit 
(3). The cleavage between S1 and S2 can be 
accomplished by several different proteases, 
including transmembrane protease serine 
2 (TMPRSS2), which is expressed in select 
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tissues, and cathepsin L, which becomes 
activated in the low-pH endosomal environ- 
ment (4, 5). The role of ACE2 in facilitating 
S1 shedding remains to be determined (6). 
Recent data show that S protein undergoes a 
conformational rearrangement at endosomal 
pH that modifies S trimer interactions and 
rotates the RBD from the “up” to the “down” 
conformation, which also influences ACE2 
binding affinity (7). The predominant reli- 
ance of SARS-CoV-2 on ACE2 for cell entry 
has led to a focus on the development of new 
methods to disrupt ACE2 binding to S protein 
as potential COVID-19 medical interventions. 

Chan et al. performed protein engineering 
studies to understand SARS-CoV-2 RBD and 
ACE2 specificity characteristics and to engi- 
neer high-affinity ACE2 variants that could 
serve as a receptor decoy and compete with 
native ACE2 for binding to the RBD on SARS- 
CoV-2. They demonstrate that a number of 
residues in ACE2 can be further optimized to 
enhance ACE2 affinity for soluble RBD bind- 
ing. The mutations providing enhanced or de- 
creased affinity give important insights about 
ACE2-RBD interactions. One key finding was 
that disrupting the Asn® glycosylation motif 
in ACE2 enhanced the RBD binding affinity, 
by about twofold for the Thr?—>Gln ACE2 
variant. Because glycosylation is a hetero- 
geneous posttranslational modification that 
varies within cells and between cell types, 
this finding implies a potential for nonglyco- 
sylated ACE2 molecules to be more permis- 
sive to infection than fully glycosylated ACE2. 
Chan et al. identified several other mutations 
at the ACE2-RBD interface that reveal more 
structural features of the ACE2-RBD com- 
plex, including for ACE2 residues 27 to 31 at 
the binding interface, and several other mu- 
tations that suggest the influence of longer- 
range protein folding interactions. 

As in Chan et al., comprehensive muta- 
genesis and functional screening are being 
used extensively to interrogate virus-cell and 
antibody-virus interactions related to SARS- 
CoV-2. Chan et al. screened cellular receptor 
variants using mammalian cells, whereas 
another recent study performed a similar 
analysis of the RBD domain of S protein 
with yeast display, revealing structural con- 
straints and affinity landscapes on the viral 
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side of the ACE2-RBD interac- 
tion (8). Mutagenesis and screen- 
ing of antibody proteins have 
been used for decades to eluci- 
date antibody structure-function 
relationships and are now being 
used to improve antibody drug 
discovery against SARS-CoV-2. 
These mutagenesis and screen- 
ing studies are accelerated by 
next-generation sequencing and 
provide rapid, high-throughput 
data on viral fusion interactions, 
along with opportunities for pro- 
tein engineering and therapeutic 
discovery of antiviral vaccines 
and biologics. 

Chan et al. outline a strategy 
to use an enhanced-affinity engi- 
neered ACE2 variant as a receptor #| 
decoy to block S protein on SARS- 
CoV-2. One engineered ACE2 
variant, sACE2,v2.4, showed 
~10-fold enhanced potency for 
preventing infection in vitro (i.e., 
neutralizing the virus) compared 
with wild-type ACE2. sACE2,.v2.4 
showed a median inhibitory con- 
centration (IC,,) neutralization 
potency against SARS-CoV-2 that 
was subnanomolar. The potency 
of dimeric sACE2,v2.4 compared favorably 
to neutralizing monoclonal antibody po- 
tencies—only a small subset of monoclonal 
antibodies also exhibit subnanomolar IC,, 
values. SACE2,.v2.4 also neutralized SARS- 
CoV (which causes SARS), suggesting that 
the engineered ACE2 mutations affect con- 
served interactions among the two related 
coronaviruses that both use ACE2 as a cell- 
entry receptor. Other receptor decoys have 
been pursued as antivirals, including against 
HIV (based on the CD4 host cell receptor) (9) 
and human rhinoviruses [based on the host 
cell receptor intercellular adhesion molecule 
(ICAM)] (10). Receptor decoys have not yet 
led to a clinically approved antiviral medica- 
tion, but some have been demonstrated to be 
safe in human trials and showed efficacy in 
reducing viral titers and symptom severity in 
a controlled prevention and challenge study 
of the common cold (JJ). 

Engineered ACE2 receptor decoys are an- 
other addition to a panoply of exciting new 
strategies to block COVID-19 by disrupting 
ACE2-S protein interactions, where major 
parallel efforts are under way (see the figure). 
RBD-focused vaccines elicit antibodies that 
neutralize SARS-CoV-2 by directly blocking 
ACE2 binding. Recombinant S protein vac- 
cines, whole-virus inactivated vaccines, and 
live-attenuated vaccines also elicit antibodies 
that interrupt binding to ACE2, in addition 
to other viral epitope targets. SARS-CoV-2 
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vaccine delivery strategies vary broadly, and 
numerous vaccines are advancing rapidly 
through clinical trials. Several small mole- 
cules have also been identified to block ACE2, 
including lectins and synthetic peptides de- 
rived from ACE2 (12). Small molecules can 
have key advantages in cost, production, 
stability, distribution, and administration 
compared with biologics. However, the less 
precise mechanisms of action and thus the 
potential for side effects increase clinical 
risk of small-molecule ACE2-S protein bind- 
ing inhibitors. Monoclonal antibodies and 
vaccines possess a very different risk profile 
than small-molecule drugs. Perhaps the high- 
est risk is antibody-dependent enhancement 
(ADE), where antibody Fc interactions can 
promote inflammation in respiratory mucosa 
that causes immunopathology. ADE is often 
associated with poorly neutralizing antibod- 
ies and has been reported for other respira- 
tory vaccines and in prior studies of Middle 9. 
East respiratory syndrome (MERS) and ii 
SARS (73). Fortunately, convalescent plasma 12. 
(which contains antibodies from recovered mh 
COVID-19 patients) therapy has revealed no 
substantial ADE burdens (/4), and potently 
neutralizing monoclonal antibodies are less 
likely to cause ADE. 

As a new biologic therapy, soluble ACE2- 
based receptor decoys have the advantage 
of no associated ADE risks, and recom- 
binant ACE2 has an established clinical 
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safety record for treating pul- 
monary arterial hypertension 
and acute respiratory distress 
(clinical trials NCT01597635 and 
NCT03177603). The major dis- 
advantage for soluble ACE2 as 
a COVID-19 preventive may be 
its relatively short half-life [~10 
hours in prior studies (15)], sug- 
gesting that it may be best suited 
for treating COVID-19. The half- 
life could be increased for pre- 
vention indications by fusing 
them to an immunoglobulin G 
Fe domain. In addition to viral 
neutralization, therapeutic ACE2 
could alleviate COVID-19 symp- 
toms by decreasing inflamma- 
tion and fluid accumulation in 
lung tissue, making engineered 
ACE2 biologics a promising ap- 
proach to treat COVID-19 that 
may synergize with other treat- 
ment modalities. 

New COVID-19 treatments and 
preventions are advancing rap- 
idly, with numerous approaches 
to disrupt ACE2-mediated viral 
entry. Clinical trial results for the 
first generation of therapies will 
likely be announced at an accel- 


erating pace toward the end of 2020, and 
additional structural information regarding 
ACE2 interactions with RBD and clarification 
of viral cell-fusion mechanisms will inspire 
new drugs to disrupt SARS-CoV-2 infection. 
Several challenges remain, including the lo- 
gistical burdens of deploying medical inter- 
ventions to blunt the spread of SARS-CoV-2. 
Methods of blocking ACE2-dependent viral 
entry that build on the growing understand- 
ing of ACE2 interactions may provide some 
of the strategies needed to suppress SARS- 
CoV-2, and other future coronaviruses. 
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The Stepford wife gets smart 


A pair of digital scholars confront the troubling implications 
of feminized household management technologies 


By Miriam E. Sweeney 


fter a long day at work, you arrive 

home to find that your robot vacuum 

has successfully cleaned your floors. 

The slow cooker has sent you a text 

to let you know that the pot roast is 

ready. The thermostat senses your 
movement and automatically adjusts to your 
preferred temperature. “Alexa, turn on living 
room lights,’ you direct, as you settle 
into your favorite chair to relax. 

Today’s smart home still closely re- 
sembles the fantasies of 1950s technofu- 
turism, complete with retrograde gender 
and (hetero)sexual politics. Whereas the 
model of 1950s household economy fea- 
tured a housewife as homemaker and 
manager of the technologically enhanced 
house, today’s smart home outsources 
much of the domestic labor to a legion 
of “smart wives.” These technologies in- 
clude feminized robots, digital voice as- 
sistants, virtual helpers, and other smart 
devices that have been designed to take 
over what Arlie Hochschild coined “the 
second shift” of unpaid household labor. 

In The Smart Wife, Yolande Strengers 
and Jenny Kennedy explore the many 
forms, representations, and roles played 
by smart wife technologies, particularly 
as they perform cleaning, caring, home- 
making, companionship, and _ sexual 
labor in the home. Strengers and Ken- 
nedy use “wife” as both shorthand and a 
metaphor for a specific form of gendered 
labor within the heterosexual marriage 
institution, a benefit historically af- 
forded to men as a precondition for 
pursuing economic opportunities and 
leisure activities outside the home. 

Employing a rich methodological toolkit 
that includes interviews with technology de- 
signers, industry insiders, and heterosexual 
couples who use smart technologies, as well 
as analyses of cultural representations of 
smart wives and their associated advertise- 
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ments, the authors contextualize the smart 
wife as a “technofix” that seeks to address 
changes that have occurred as women have 
entered the paid labor force and gained 
greater legal and social status outside of the 
institution of marriage. Smart wives thus 
provide a lens to explore “social and politi- 
cal agendas about the role of women, wives, 
and heteronormative relationships” in many 
contemporary Western societies. 


Subtle design cues have led many to infer that the social robot 
Pepper, who is technically “gender ambiguous,” is female. 


Most of the book’s chapters are organized 
around a different smart wife character that 
provides entry into gendered functions and 
social contexts. The Jetsons’ robot, Rosie, 
for example, is the prototypical smart wife 
performing housework and cleaning tasks, 
whereas Amazon's Alexa helps the authors 
initiate a conversation about smart wives as 
surveillance technologies, digital homemak- 
ing, and “pleasance’—a “fundamental feeling 
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The Smart Wife: Why Siri, 
Alexa, and Other 

Smart Home Devices 
Need a Feminist Reboot 
Yolande Strengers and 
Jenny Kennedy 

MIT Press, 2020. 320 pp. 


that is hard to define but that people desire to 
experience,” according to the home automa- 
tion company Lutron. The chapter about the 
sexbot Harmony, meanwhile, provokes ques- 
tions about intimacy, sex, and consent. Each 
example is well chosen to facilitate insight- 
ful discussion about the problematic gender 
stereotypes and inequalities built into smart 
wife technologies and to address the poten- 
tial for harm, violence, and continued devalu- 
ation of women and the feminized forms of 
labor posed by such entities. 

The authors acknowledge that their book 
takes a perspective that prioritizes “the 
gender politics of smart technology in 
privileged Western homes,” identifying 
their own positions as advantaged, het- 
erosexual white women and recogniz- 
ing their decision to “gloss over other 
powerful narratives” that emphasize 
race and class. Although I appreci- 
ated the directness of this disclosure, it 
nonetheless seemed like a missed op- 
portunity to integrate the perspectives 
and cultural histories of women of 
color, poor white women, non-Western 
women, and immigrant women, whose 
domestic labor continues to enable 
middle-class white women to partici- 
pate in the paid labor force. 

Similarly, integrating more perspec- 
tives from LGBTQ people might have 
helped to advance the authors’ stated 
goals of queering smart wife technolo- 
gies and developing smart wives that 
promote gender equity and diversity. 

The authors resist absolutes and 
easy conclusions. They recommend a 
“reboot” of smart wife technologies, for 
example, while acknowledging that this 
will require those of us who imagine, 
design, build, and interact with smart 
wives to challenge the norms and pre- 
conceptions that form the basis of our 
interactions with these devices. In a 
provocative call to technology designers, they 
urge us to engage with the complexities in- 
herent in smart wife technologies, which they 
believe represent a potential site of interven- 
tion in the struggle for gender equity. 
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GENETIC ENGINEERING 


Biology’s brave new world 


The promise and perils of synthetic biology take center 
stage in a fast-paced new series 


By Dov Greenbaum 


he first season of the Netflix series 
Biohackers, consisting of six episodes 
released on the streaming platform 

on 20 August, tells a fictional tale 
centered around the _ sociotechno- 
logical movement known as do-it- 
yourself (DIY) biology, in which amateurs, 
professionals, anarchists, and civic-minded 
citizens push the boundaries of mainstream 
biology. The show’s main characters include 
a wealthy biopharmaceutical executive, a 
group of medical students, a number of ste- 
reotypical biohackers making animals glow 
and plants play music, and a community of 
transhumanists intent on modifying their 
bodies for seemingly impractical endeavors. 
Whereas biological experimentation was 
once the sole domain of trained profes- 
sionals in well-stocked and well-funded 
institutional labs, the field has been de- 
mocratized by the emergence of the open- 
source movement, plummeting sequencing 
costs, greater access to reagents and de- 
vices, the proliferation of online resources, 
and the emergence of tools and methodol- 
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ogies that enable nonexperts to genetically 
engineer organisms without years of pro- 
fessional training. [Valid concerns regard- 
ing some of the activities associated with 
the DIY bio community have been voiced 
by the Presidential Commission for the 
Study of Bioethical Issues (1).] 

The show follows Mia Akerlund (played 
by Luna Wedler), a first-year medical stu- 
dent vying for a position at a prestigious 
biopharmaceutical firm headed by cel- 
ebrated professor Tanja Lorenz (Jessica 


goes out of his way to present complex 
science as accurately as possible. In one 
montage, for example, we watch various 
biohackers, some with better aseptic tech- 
nique than others, add reagents to micro- 
centrifuge tubes, load polymerase chain 
reaction machines, and examine gels to as- 
sess whether they have accurately created 
a desired genomic sequence. In another 
scene, a student suffering from a degenera- 
tive disease seeks to develop his own cure 
in a secret lab, where he can work with- 
out burdensome oversight. The student in- 
jects himself with an unknown liquid, his 
purported cure. Here, the show’s dialogue 
surrounding the cure and its antidote (to 
be administered if things go wrong) offers 
insight into how RNA interference thera- 
pies work. 

But the show also serves as a pedagogi- 
cal vehicle to raise many timely and in- 


Schwarz). Akerlund and _ Lorenz teresting ethical, legal, and social 
clearly have some shared history, as Oe eet concerns. From bioluminescent 
well as their own secrets, although _Biohackers — mammals to the collection of genetic 

. ; . Christian Ditter ; or A onl 
viewers are not privy to the details Netflix. 2020 material for clinical trials, the series 
of either at the start of the series. 6 episodes. storyline highlights how cavalierly 


For much of the first episodes, the 
relationship between these two enigmatic 
characters is revealed slowly through both 
flash-forwards and flashbacks. But we 
know that a big reveal is coming; the pro- 
gram’s official description teases a “secret 
so big it could change the fate of humanity.” 
Throughout the season’s six fast-paced 
episodes, the viewer is exposed to technolo- 
gies and techniques that would be familiar 
to many professional scientists. And while 
the time frames of the various experiments 
conducted are often compressed for dra- 
matic effect, Christian Ditter—the show’s 
creator, writer, director, and showrunner— 


Medical student Mia Akerlund (right) meets biohackers pushing the boundaries of mainstream biology. 
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we sometimes approach genomic 
data and genetic engineering. Later epi- 
sodes depict even more egregious exam- 
ples of biohacking, including organisms 
modified to transmit viruses as efficiently 
as possible. At one point, a character sug- 
gests that the ends of her research justify 
the experimental means, even when her 
methods demonstrate a gross disregard for 
test subjects who may suffer as a result. 

The show also offers insight into some 
of the motivations that drive DIY biology 
efforts. For example, in one scene, a con- 
fidant of Akerlund expresses dismay that 
Lorenz is willing to sell a cheaply acquired 
drug to desperate patients for inflated 
prices. Such frustrations are what drive 
many citizens operating outside tradi- 
tional institutions to develop their own 
pharmaceutical solutions. 

It is ironic that Biohackers is set in Ger- 
many, one of the few places where genetic 
engineering experimentation outside of li- 
censed facilities is illegal and can result in 
a fine or even imprisonment (2). Yet, given 
all that transpires in the show, one is left 
with the sense that such measures may 
be justified. 
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Editorial Expression 
of Concern 


On 10 July, Science published the Report 
“Proton transport enabled by a field- 
induced metallic state in a semiconductor 
heterostructure” by Y. Wu et al. (1). After 
publication, we received emails from two 
independent sources alleging that figures 
S9B and S10 in this paper appeared to be 
manipulated copies of figures previously 
published by the same corresponding 
author, Bin Zhu, in two papers (2, 3) that 
reported different fuel cell compositions. 
The allegations were subsequently posted 
to PubPeer (4). While we await clarifica- 
tion from investigations by the authors’ 
institutions, we are notifying readers of our 
concern about the paper’s data integrity. 


H. Holden Thorp 
Editor-in-Chief 
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The dangers of Arctic 
zombie wildfires 


In June, blistering wildfires raged within 
the Arctic Circle for the second successive 
year (1). Posing no direct threats to human 
life or property, Arctic wildfires are usu- 
ally allowed to burn unabated. They can 
then smolder beneath the Arctic Circle’s 
blanket peat through winter and reignite 
during early spring when temperatures 
rise. Scientists attribute this year’s blazes 
in the northern polar region to persistent 
remnants of wildfires from the summer of 
2019 (2). In addition to changing eco- 
systems within the scorched perimeter, 
these so-called Arctic zombie wildfires 

(2) amplify climate warming by releasing 
carbon from soil and affect human health 
around the planet by releasing particu- 
lates into the air (3). To address these fires 
and their interaction with other compo- 
nents of the Earth system, researchers 
must collect more data and update models 
to account for these feedbacks. It is cru- 
cial to understand what conditions cause 
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the fires as well as the effects of the fires 
on the environment. 

The growing regularity of Arctic zom- 
bie wildfires demonstrates the risks of 
compound climate events (4) under global 
warming. These wildfires shed light on 
the domino effects of coinciding, multiple, 
interdependent natural hazards (5) within 
the Arctic Circle, particularly extreme 
drought and persistent heat waves. Climatic 
drivers of wildfires within the Arctic 
Circle—including temperature (6), dry air- 
flow (7), lightning frequency (8), and wind 
speed (9)—are increasing, making wildfire 
recurrence likely (J0). Yet our knowledge 
about the fires is largely limited to the past 
2 years (3). We lack sufficient data about 
the location and size of roasting areas, 
the amount of atmospheric heat-trapping 
greenhouse gas (CO, and CH,) emissions, 
the paths of the smoke plumes, and the sites 
of fire-related black carbon deposition (3). 

To better understand and manage these 
fires, researchers should comprehensively 
assess the cascading risks (4) that lead to 
their ignition and endurance, such as soot 
impacts on snow and ice covers. Theoretical 
wildfire models should be updated to 
explain swelling Arctic blazes and consider 
changes in their environmental drivers 
(such as peat fuel) and climatic drivers 
(such as extreme winds). Researchers 
should identify and assess both direct and 
indirect environmental and socioeconomic 
impacts of the fires and determine the 
global challenges for humanity that are 
likely to result, including the future climate 
risk that compound events related to the 
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fires could induce. Immediate actions and 
sustained efforts at national and interna- 
tional scales are needed to mitigate Arctic 
zombie wildfires through global multidisci- 
plinary collaboration. 
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Support transgender 
scientists post-COVID-19 


The coronavirus disease 2019 (COVID-19) 
pandemic is magnifying existing oppression 
(1-3), sparking discussions among scientists 
about the post-pandemic community we 
want to build. Transgender scientists should 
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be a part of these conversations to ensure 
that their needs are recognized as we strive 
to make science more diverse and inclusive. 

Transgender people face barriers to 
becoming scientific leaders at every 
career stage (4). From primary school 
onward, students are taught science from 
a cis-normative perspective (5, 6), where 
sex and gender are synonymous and 
binary. Alongside bullying and lack of 
family support, this structural denial of 
“non-standard” genders leads to elevated 
school dropout rates (7). In university and 
professional settings, transgender people 
face more harassment and mental health 
issues than their cisgender colleagues 
(4). Hate crimes and discriminatory laws 
often target transgender people (8), and 
they are routinely denied health care (5, 9, 
10), complicating their efforts to maintain 
mental and physical health, which are 
important for success. Gaps in health care 
are likely worsened in medical systems 
overloaded by the pandemic. 

Transgender people are often put at 
a disadvantage by bias in academic and 
professional hiring decisions (1). Less 
likely to have established careers (17), they 
are vulnerable to the funding cuts and 
unemployment arising from the pandemic. 
Despite these inequalities, affirmative 
policies rarely include transgender as a 
minority identity. The disruptions caused by 
COVID-19 have likely burdened transgender 
scientists with an outsized share of the pov- 
erty, disease, and exclusion from science. 

This moment is a timely opportunity to 
build a broader path to welcome all gender 
identities. All scientists should respect 
chosen names and pronouns, speak out 
against anti-transgender policies and laws, 
and challenge perspectives in scientific 
culture that erase transgender experiences. 
Institutions should develop transgender- 
inclusive practices (72), such as creating 
inclusive name change policies, allocating 
funds to support transgender scientists’ 
careers, and broadening access to inclusive 
health care. Determining the most effective 
approaches to reducing barriers for trans- 
gender scientists will require more studies 
highlighting their experiences. Besides the 
inherent moral value of making academia 
more equitable, scientific endeavors will 
benefit tremendously from the intellectual 
potential of a greater diversity of people. 
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Adapt taxonomy to 
conservation goals 


As the sixth mass extinction of species con- 
tinues to unfold (J), cataloguing uncharted 
biodiversity to ensure efficient conservation 
efforts becomes ever more important (2). 
Taxonomists and biodiversity scientists are 
addressing this challenge with the help of 
technological leaps forward in molecular 
biology and digital archiving (3). However, 
increasing restrictions on the permits 
required to collect specimens complicate 
their efforts (4). The difficulties inherent 

in collecting a required type specimen 

(an individual that serves as a reference) 
can delay species identification, and vital 
conservation action often cannot take place 
until the species is protected by name (5, 6). 
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Formalization of criteria to describe a species 
without a type specimen would expedite nec- 
essary conservation efforts and enforcement. 

The problems collecting type specimens 
are particularly pertinent to taxa facing 
imminent conservation threats, such as the 
birds of Southeast Asia, which are threat- 
ened by a vigorous pet trade (7). Difficulties 
in procuring type specimens have left 
observed but unidentified species without 
formal recognition for years (8—10). This 
gap between initial recognition and formal 
description may even heighten conser- 
vation pressure, as species can become 
targets of traders yet receive no legislative 
protection (10). Without a formal descrip- 
tion, a species cannot typically be officially 
considered at risk; thus, their trade is not 
necessarily illegal, regardless of the extinc- 
tion risk they face (6). 

While it is technically possible to describe 
a species without a type specimen (4), it 
is rare and not widely accepted by expert 
bodies. Taxon experts could help by provid- 
ing standardized guidelines designating the 
acceptable alternative evidence, potentially 
including DNA barcodes, photographs, 
recordings, and measurements. Making such 
allowances would facilitate streamlined spe- 
cies description where conservation peril is 
real, while maintaining high taxonomic stan- 
dards (3, 11). Specimen collection remains an 
essential tool (72) and should always be the 
“gold standard,” but it must not serve as an 
obstacle to urgent conservation efforts. 
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IN FLUX AND 


—— UNDER THREAT 


By Tage Rai and Brad Wible 


round the world, democracy is losing 
ground. Polarization and disinforma- 
tion have rendered liberals and conser- 
vatives unable to agree on basic facts. 
State violence and suppression of citi- 
zens’ rights are resurgent. Free and fair 
elections are being threatened. 

In this special issue, we critically ex- 
amine the state of democracy and how it 
must adapt to achieve its ideals in the 21st century. 
We need to meet the challenges and opportunities 
of living in increasingly multiethnic societies, of 
fostering democracy in a weakened international 
environment, of reducing inequality and elevat- 
ing the political representation of the poor, and 
of organizing social movements and combating 
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disinformation tactics in the digital age. Advances 
in technology are making it easier to distort true 
voter representation through gerrymandering, 
and political campaigns continue to struggle with 
reaching voters and persuading them to partici- 
pate. Worryingly, state violence, which has always 
been a core feature of the democratic experience 
for some, is spreading in democratic societies. 
Twenty years ago, it seemed inevitable that 
democracy would reach every corner of the globe. 
In this moment, we are reminded that we must 
fight for democracy and work to improve it. 
A scientific understanding of the social and 
behavioral phenomena that underlie its operation 
will help us enhance democracy and, by doing so, 
improve human lives and societies globally. 
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Protests against mistreatment 
of Black people by police 
highlight that many U.S. citizens 
effectively live outside the 
provisions of the Constitution. 


Racial authoritarianism in U.S. democracy 


One segment of the population experiences different rules and differential citizenship. 


By Vesla M. Weaver! and Gwen Prowse? 


ecently, casual and savage violence 

of police against peaceful protest- 

ers and images of police in military 

gear sweeping up residents into un- 

marked vans has led journalists to 

question whether U.S. democracy 
is in peril. Many observers described these 
recent actions as authoritarian. But racial 
authoritarianism has been central to citizen- 
ship and governance of race-class subjugated 
communities throughout the 20th and early 
21st centuries. It describes state oppression 
such that groups of residents live under ex- 
tremely divergent experiences of government 
and laws. Yet when police engage in excessive 
surveillance, incursions on civil liberties, and 
arbitrary force as a matter of routine patrol, 
many scholars of American politics are reluc- 
tant to consider it a violation of democracy 
and instead deem them aberrations in an 
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otherwise functioning democracy. This mis- 
characterization is not limited only to intel- 
lectual discourse but also affects the public 
sphere. By obscuring evidence of racial au- 
thoritarianism, reforms will not land where 
needed. Procedural reform is useful when we 
are simply improving policing, not ridding 
democracy of authoritarian practices. 

Racial authoritarian governance has 
deeply shaped our institutions, political ar- 
rangements, and state development, and 
virtually every racial justice movement over 
the past 100 years has tried to expose its 
operation, challenge it, and seek freedom 
from it (1). Coterminous with democracy in 
the United States, racially authoritarian pat- 
terns are reproduced and innovated after pe- 
riods of democratic expansion in the United 
States. Since the 1960s, policing has been 
the primary administrative tool of racial au- 
thoritarianism: One segment of the popula- 
tion effectively lives under a different set of 
rules and, as a result, experiences differential 
power and citizenship. 

Although many Black intellectuals and 
citizens have understood how authoritarian 
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power operates on citizens within a democ- 
racy, scholars of U.S. politics largely overlook 
state power to coerce, surveil, and enact vio- 
lence often by police authorities and treat it 
as unimportant to theorizing our democracy. 
Starting from the assumption of a liberal 
tradition and examining deviations from 
a mostly pluralistic polity, they document 
evidence of democratic retreat only when 
political competition is curtailed and trust in 
governing institutions erodes, despite over- 
whelming evidence of racial authoritarian- 
ism. This view, stretching from the field’s de- 
fining scholars to the present day, is housed 
within a polity that was increasingly turning 
to, and expanding, its coercive instruments of 
surveillance, predation, violent intimidation, 
and confinement, concentrated on race-class 
subjugated residents. 

The result is a substantive and substantial 
narrowing: By failing to consider the possi- 
bility of widespread, coherent, and racially 
targeted authoritarian practices, the focus 
in academic debates becomes improving as- 
pects of democratic quality and the distribu- 
tion and delivery of democratic goods—more 
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representation, more votes, more responsive 
policy—while rendering invisible the lack of 
autonomy and freedom, and the vulnerabil- 
ity to state violence and illegal takings, that 
characterizes the experience of U.S. democ- 
racy for those experiencing its more authori- 
tarian aspects. We should augment our un- 
derstanding, theories, and measurement to 
encompass or reconcile the presence of such 
authoritarian practices within U.S. democ- 
racy. In addition to measuring democratic 
performance through national indicators 
such as free and fair elections, we should also 
include local coercive practices concentrated 
on subgroups of the population. 


A TRENCHANT REBUTTAL 

Once we look beyond democracy’s formal 
structures, institutions, and rules to the 
lived experiences of political authority, we 
see that they pose a sharp contrast, and a 
trenchant rebuttal, to the conventional un- 
derstandings of liberal democracy. For exam- 
ple, drawing on the largest database of narra- 
tive accounts of policing in U.S. cities after the 
Baltimore uprising of 2015, we see that U.S. 
residents have a sophisticated understanding 
of the actual operation of democracy and are 
witnesses to its relationship to authoritar- 
ian practices (2). Stopped by police, subject 
to violation of privacy and displays of force, 
routine seizure of resources, and unable to 
freely assemble because of police occupation 
of their neighborhoods, they described being 
effectively outside the provisions of the main 
text of U.S. democracy—the Constitution: 

“But every black and every Hispanic that 
gets stopped, especially here in LA, they 
asked to get out their car...okay. And it’s a 
difference. When you're telling me, you’re go- 
ing to go and say, ‘Oh you're just nitpicking, 
youre crying, youre complaining’ But we live 
this. You see? We live it” [(2), p. 1162]. 

“Theyre paid to protect and serve but 
they’re not protecting us, they’re not serving 
us, they’re killing us and eliminating us” [(2), 
p. 1160]. 

Police have long proscribed the movement 
of Black communities and engaged in racial 
and social control. When historians inter- 
viewed several thousand Black Americans 
who had lived under Jim Crow (state and lo- 
cal laws that enforced racial segregation and 
disenfranchisement in the U.S. South) in the 
1930s and 1940s, police were understood to 
be guardians of white democracy (3). They 
described orientations similar to conversa- 
tions about life decades later. For example, 
how police goaded Black people into displays 
of force: “They would come and mess with 
you in order for you to say something ... This 
gave them an excuse to hit you, you know” 
(3). State violence through police was wit- 
nessed, as well as the absence of accountabil- 
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ity when police executed Black people. 

When we look to narrative accounts and 
the undemocratic practices they reveal, we 
may be better equipped to anticipate criti- 
cal ruptures in political life. State practices 
of policing, surveillance, and impunity that 
are quotidian for racially subjugated people, 
when popularized, become worrying signs of 
an authoritarian turn. 


HIDING IN PLAIN VIEW 

Despite racial authoritarianism’s glaring 
presence in experiential accounts of U.S. de- 
mocracy, it has been hiding in plain view in 
the field of political science. In a field respon- 
sible for constructing metrics on democratic 
stability and political behavior, our failure 
to theorize racial authoritarianism has had 
consequences for how U.S. democracy is con- 
ceived by the public and policy-makers. 

There are several reasons why racial au- 
thoritarianism in the United States has, for 
so long, gone unnamed by our field. One 
reason is because scholars tend to discount 
knowledge derived from a bottom-up ap- 
proach (actual citizen experience), which 
may obscure our understanding of how gov- 
ernment authority is actually experienced. 
Empirical research on democracy leans heav- 
ily on quantifiable indices (such as the Polity 
Index) and nationally representative survey 
samples. These measures are useful tools for 
comparative analysis and standardized snap- 
shots for change over time, but they do not 
leave room for citizens to define democratic 
deficits on their own terms or through their 
own experiential accounts. When we use nar- 
rative accounts as the lens through which we 
view U.S. democracy, racial authoritarianism 
comes clearly into focus. 

Relatedly, scholars tend to fixate on nation- 
ally representative institutions and political 
activities such as voting and operate from 
an overly narrow definition of authoritarian 
practices (executive power grabs, direct po- 
lice collusion, and limited political competi- 
tion). But the focus on executive overreach 
can be misleading in a political system as de- 
centralized as the United States, where local 
governments have high levels of autonomy 
over police authority in particular. Without a 
focus on the local or subnational level, it is 
easy to overlook the ways in which U.S. feder- 
alism facilitates racial authoritarianism. 

Third, for scholars who have written about 
modern policing practices, there is no short- 
age of analysis of their racially disparate 
outcomes (J). But students of political sci- 
ence have tended to examine the coercion, 
occupation, subjectivity, and extraction that 
constitute what we call racial authoritarian- 
ism in isolation from democracy. We tend to 
analyze racialized policing within a separate 
literature on incarceration and criminal jus- 
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tice; but why should we not also analyze it in 
the literature on democratic transitions, sub- 
national or group-based authoritarianism, 
and political violence? 

If the field of political science sequestered 
police repression from questions of democ- 
racy, historical Black thinkers did not. An 
understanding of racial authoritarianism— 
although completely absent in mainstream 
scholarship—animated historical Black 
theorizations that contested U.S. democ- 
racy’s hard line boundary from authoritar- 
ian modes of governance. They saw police 
violence and power as a central instrument 
upholding the differentiated citizenship 
key to the operation of democracy in the 
United States. For example, in 1966 James 
Baldwin wrote, “I have witnessed and en- 
dured the brutality of the police many more 
times than once—but, of course, I can- 
not prove it. I cannot prove it because the 
Police Department investigates itself, quite 
as though it were answerable only to itself. 
But it cannot be allowed to be answerable 
only to itself. It must be made to answer to 
the community which pays it, and which it 
is legally sworn to protect, and if American 
Negroes are not a part of the American 
community, then all of the American pro- 
fessions are a fraud” (4). 

This brings us to the final reason, which 
is that we have been working from founda- 
tions of a discipline that has segregated and 
isolated Black knowledge. For example, our 
field’s most vaunted scholar of American 
democracy, Robert Dahl, theorized civic life 
through a case study in New Haven dur- 
ing a period of mass racial upheaval across 
northern U.S. cities (5). Yet, Dahl’s account 
portrayed a democracy that subjugated Black 
citizens did not live and had never taken part 
in. Political science scholars have typically ex- 
amined democratic deficits as a question of 
who is represented and how; they tend to fo- 
cus on exclusion from political participation 
or social citizenship, or hindrances on the 
ability of citizens to have equal influence (6). 

Scholarly treatises flowed from Dahl’s con- 
ceptions but stood uneasily alongside a cho- 
rus of Black intellectuals, folk leaders, and 
activists that contested the clean distinction 
between democracy and authoritarian rule. 
Instead of describing pluralism, polyarchy, 
and liberalism, they called attention to un- 
democratic legacies, visible and unapologeti- 
cally practiced on their streets. 

That mainstream approaches have hard- 
ened into deep scholarly grooves has had 
consequences. Today, students learn about 
authoritarianism abroad. We are taught 
American exceptionalism, the idea that the 
United States is singular for its old consti- 
tution, institutional arrangements such as 
federalism, lack of feudalism, and weaker 
1177 
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welfare state, not because we have a racial 
authoritarianism distinct from any other 
nation in the western world. When we do 
recognize authoritarian governance in the 
United States, it is a past relic, confined to 
the post-emancipation U.S. South where 
Black disenfranchisement, one-party rule, 
and explicit political violence reigned but 
was eventually overcome. And when schol- 
ars present evidence of democratic back- 
sliding in contemporary U.S. politics, they 
ignore the expansion of racial repression, 
focusing instead on polarization, distrust in 
institutions, and extreme income inequal- 
ity—all of which themselves derive from or 
are linked to racial authoritarianism. 


PROMISING FRAMEWORKS 

How can scholars study authoritarian modes 
of governance within democratic states? 
What can attending to racial authoritarian- 
ism teach us about the nature and evolution 
of U.S. democracy? Fortunately, there are 
promising theories on which we can build. 
A few scholars have pointed to the pos- 
sibility that authoritarian practices co- 
exist within formally democratic states 
and institutions. King and Smith have 
argued that U.S. democracy was formed 
through the contestation of liberal egali- 
tarian ideologies and illiberal, ascriptive 
hierarchy (7). Miller describes “racial- 
ized state failure” in which U.S. federal- 
ism and racism interact to create condi- 
tions comparable with those of failed states 
(such as extreme levels of homicide, state 
violence, and imprisonment) (8). Hanchard 
reminds us that the most celebrated de- 
mocracies, back to ancient Athens, had the 
longest histories of racial slavery, subjuga- 
tion, imperialism, police terror, and highly 
unequal labor regimes (9). He argues against 
typical stances in our field that tend to ignore 
the coexistence of democracy and ethnoracial 
hierarchy and that the former’s institutional 
development was shaped by the latter: “The 
seemingly straightforward genealogy that 
reduces democracy to its formal and perfor- 
mative elements ignores how coercion, em- 
pire, and forced labor have been deeply in- 
tertwined in democratic experiments in the 
Greek city-states and in contemporary societ- 
ies” [(10), p. 68]. 

The literature on the relatively recent de- 
mocratization of the United States also offers 
an opening. Scholars of U.S. and comparative 
political development have long understood 
one-party rule in the South before the Second 
Reconstruction (1945-1968) as authoritarian. 
Mickey has analyzed these “authoritarian en- 
claves,’ that “created and regulated racially 
separate—and significantly unfree—civic 
spheres” [(10), p. 5]. Gibson has described 
subnational authoritarianism in the United 
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States as compatible with, and enabled by, 
federal democratic institutions before the 
Second Reconstruction (17). However, schol- 
ars stop shy of theorizing the persistence or 
reemergence of authoritarian practices after 
the fall of territorial subnational authoritari- 
anism in the 1960s. 

Last, we can learn from scholars working 
outside the United States who have analyzed 
and provided theories to explain conditions 
that aid the endurance of coercive institu- 
tions in democracies, including the police, 
who further “stratify citizenship” along the 
dimensions of race, class, and geography by 
failing to protect citizens, serving instead the 
interests of the state and engaging in extra- 
legal force (72). In countries with histories 
of military rule, norms of police violence en- 
dure in the transition to democracy. During 
dictatorships, even the middle classes are 
subject to state and police repression, but 
this falls away under democratic reforms; 
ironically, the rise of democracy helps con- 
centrate police violence on poor and raced 


“.»dronically, the rise of democracy 


helps concentrate police violence 
on poor and raced groups.” 


groups. Citizens being “outlaws” in Bolivia— 
unprotected by police and law but also 
subject to its capricious regulation—draws 
parallel to Black communities in the United 
States experiencing “legal estrangement” 
(13, 14). How might scholars better connect 
racial authoritarianism across democracies? 

Unlike Latin American cases, where au- 
thoritarian practices predated and then sur- 
vived democratic openings, in the United 
States, authoritarian policing tended to 
develop after democratic expansions. State 
power to surveil and confine citizens in- 
creased in response to a wave of democratiza- 
tion in both the First (1863-1877) and Second 
Reconstruction. On the heels of the abolition 
of slavery, new forms of repression evolved, 
including the leasing of Black convict labor; 
after the voting, civil rights, and fair hous- 
ing acts of the 1960s, racially targeted polic- 
ing practices grew on nearly every indicator 
(2). Scholars should account for whether and 
why police power and Black mass imprison- 
ment have tended to grow in relation to peri- 
ods of formal democratization. 


ANEMIC, DISTORTED, DIRE 

The United States is now and has histori- 
cally been characterized by high levels of 
state control of and violence toward ra- 
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cially subjugated groups alongside formal 
political freedom. Just as racial slavery 
defined U.S. democracy historically, racial 
authoritarianism continues to define the 
practices of our democracy. In the current 
political moment, recognition of the fray- 
ing of democratic institutions has collided 
with a movement for Black liberation from 
police atrocities. Scholars often do the 
work of making such a connection legible 
more broadly. But if scholars continue to 
keep the former separate from the latter 
by ignoring racial authoritarianism, we 
will continue to have an anemic and dis- 
torted conception of U.S. democracy, with 
potentially dire consequences for policy. It 
is perhaps unsurprising that the media has 
followed suit, presenting racialized polic- 
ing as distinct from democratic backslid- 
ing, linked only by the executive’s rhetoric 
and actions. 

Political scientists prepare and educate the 
next generation of civic leaders, teachers, pol- 
icy-makers, pollsters, and change agents; by 
representing to them democracy in this 
way, we give them a half-truth, a flawed 
understanding of U.S. democracy, which 
may shrink policy agendas and political 
discourse more broadly. The analysis 
and description of democratic frame- 
works—and, for example, backsliding— 
influences the media and carries weight 
in policy circles (15). Thus, it is essential 
that political scientists continue to offer 
theories for understanding democracy with 
attention to its actual practice in heavily po- 
liced communities, so as not to squander an 
opportunity to improve it. @ 
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Human-centered redistricting 
automation in the age of Al 


Human-machine collaboration and transparency are key 


By Wendy K. Tam Cho!2%45 and Bruce E. Cain®’ 


edistricting—the constitutionally man- 

dated, decennial redrawing of elec- 

toral district boundaries—can distort 

representative democracy. An adept 

map drawer can elicit a wide range of 

election outcomes just by regrouping 
voters (see the figure). When there are thou- 
sands of precincts, the number of possible 
partitions is astronomical, giving rise to enor- 
mous potential manipulation. Recent techno- 
logical advances have enabled new computa- 
tional redistricting algorithms, deployable 
on supercomputers, that can explore tril- 
lions of possible electoral maps without hu- 
man intervention. This leaves us to wonder 
if Supreme Court Justice Elena Kagan was 
prescient when she lamented, “(t)he 2010 re- 
districting cycle produced some of the worst 
partisan gerrymanders on record. The tech- 
nology will only get better, so the 2020 cycle 
will only get worse” (Gill v. Whitford). Given 
the irresistible urge of biased politicians to 
use computers to draw gerrymanders and 
the capability of computers to autonomously 
produce maps, perhaps we should just 
let the machines take over. The North 
Carolina Senate recently moved in this 
direction when it used a state lottery 
machine to choose from among 1000 
computer-drawn maps. However, im- 
proving the process and, more impor- 
tantly, the outcomes results not from 
developing technology but from our 
ability to understand its potential and 
to manage its (mis)use. 

It has taken many years to develop 
the computing hardware, derive the 
theoretical basis, and implement 
the algorithms that automate map 


A 50-50 party split can be turn 
When redistricting a state with 
political manipulation is enormous. 


The most important decisions in devising an 
electoral map are grounded in philosophi- 
cal or political judgments about which the 
technology is irrelevant. It is nonsensical to 
completely transform a debate over philo- 
sophical values into a mathematical exercise. 

As technology advances, computers are 
able to digest progressively larger quanti- 
ties of data per time unit. Yet more com- 
putation is not equivalent to more fairness. 
More computation fuels an increased ca- 
pacity for identifying patterns within data. 
But more computation has no relationship 
with the moral and ethical standards of an 
evolving and developing society. Neither 
computation nor even an equitable process 
guarantees a fair outcome. 

The way forward is for people to work 
collaboratively with machines to produce 
results not otherwise possible. To do this, 
we must capitalize on the strengths and 
minimize the weaknesses of both artificial 
intelligence (AI) and human intelligence. 
Ensuring representational fairness requires 
metacognition that integrates creative and 
benevolent compromises. Humans have the 


Time to regroup 
Markedly different outcomes can emerge when six Republicans and six 
Democrats in these 12 geographic units are grouped into four districts. 
ed into a 3:1 advantage for either party. 

thousands of precincts, the potential for 


@ Republican (R) district © Democrat (D) district 
12 voters (6 R, 6 D) 


4 districts: 1R, 3D 


creation (both generating enormous 
numbers of maps and uniformly sam- 
pling them) (/-4). Yet these innova- 
tions have been “easy” compared with 
the very difficult problem of ensur- 
ing fair political representation for 
a richly diverse society. Redistricting 
is a complex sociopolitical issue for 
which the role of science and the 
advances in computing are nonobvi- 
ous. Accordingly, we must not allow a 
fascination with technological meth- 
ods to obscure a fundamental truth: 
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advantage over machines in metacognition. 
Machines have the advantage in produc- 
ing large numbers of rote computations. 
Although machines produce information, 
humans must infuse values to make judg- 
ments about how this information should 
be used (5). 

Accordingly, machines can be tasked 
with the menial aspects of cognition—the 
meticulous exploration of the astronomi- 
cal number of ways in which a state can be 
partitioned. This helps us classify and un- 
derstand the range of possibilities and the 
interplay of competing interests. Machines 
enhance and inform intelligent decision- 
making by helping us navigate the unfath- 
omably large and complex informational 
landscape. Left to their own devices, hu- 
mans have shown themselves to be unable 
to resist the temptation to chart biased 
paths through that terrain. 


HOW MIGHT COLLABORATION HAPPEN? 
The ideal redistricting process begins with 
humans articulating the initial criteria for 
the construction of a fair electoral map (e.g., 
population equality, compactness measures, 
constraints on breaking political subdivi- 
sions, and representation thresholds). Here, 
the concerns of many different communities 
of interest should be solicited and consid- 
ered. Note that this starting point already re- 
quires critical human interaction and consid- 
erable deliberation. Determining what data 
to use, and how, is not automatable (e.g., citi- 
zen voting age versus voting age population, 
relevant past elections, and how to 
forecast future vote choices). Partisan 
measures (e.g., mean-median dif- 
ference, competitiveness, likely seat 
outcome, and efficiency gap) as well 
as vote prediction models, which are 
often contentious in court, should be 
transparently specified. 

Once we have settled on the inputs 
to the algorithm, the computational 
analysis produces a large sample of 
redistricting plans that satisfy these 
principles. Trade-offs usually arise 
(e.g., adhering to compactness rules 
might require splitting jagged cit- 
ies). Humans must make value-laden 
judgments about these trade-offs, of- 
ten through contentious debate. 

The process would then iterate. 
After some contemplation, we may 
decide, perhaps, on two, not three, 
majority-minority districts so that 
a particular town is kept together. 
These refined goals could then be 
specified for another computational 
analysis round with further delib- 
eration to follow. Sometimes a Pareto 
improvement principle applies, with 
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the algorithm assigned to ascertain whether, 
for example, city splits or minority represen- 
tation can be maintained or improved even 
as one raises the overall level of compliance 
with other factors such as compactness. In 
such a process, computers assist by clarifying 
the feasibility of various trade-offs, but they 
do not supplant the human value judgments 
that are necessary for adjusting these plans 
to make them “humanly rational.” Neglecting 
the essential human role is to substitute ma- 
chine irrationality for human bias. 

Automation in redistricting is not a sub- 
stitute for human intelligence and effort; its 
role is to augment human capabilities by reg- 
ulating nefarious intent with increased trans- 
parency, and by bolstering productivity by ef- 
ficiently parsing and synthesizing data 
to improve the informational basis for 
human decision-making. Redistricting 
automation does not replace human 
labor; it improves it. The critical goal 
for AI in governance is to design suc- 
cessful processes for human-machine 
collaboration. This process must in- 
hibit the ill effects from sole reliance 
on humans as well as overreliance on 
machines. Human-machine collaboration is 
key, and transparency is essential. 


IRCS AND TRANSPARENCY 
The most promising institutional route in the 
near term for adopting this human-machine 
line-drawing process is through indepen- 
dent redistricting commissions (IRCs) that 
replace politicians with a balanced set of 
partisan citizen commissioners. IRCs are a 
relatively new concept and exist in only some 
states. They have varied designs. In eight 
states, a commission has primary responsi- 
bility for drawing the congressional plan. In 
six, they are only advisory to the legislature. 
In two states, they have no role unless the 
legislature fails to enact a plan. IRCs also 
vary in the number of commissioners, parti- 
san affiliation, how the pool of applicants is 
created, and who selects the final members. 
The lack of a blueprint for an IRC allows 
each to set its own rules, paving the way for 
new approaches. Although no best practices 
have yet emerged for these new institutions, 
we can glean some lessons from past efforts 
about how to integrate technology into a 
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partisan balanced deliberation process. For 
example, Mexico’s process integrated algo- 
rithms but struggled with transparency, and 
the North Carolina Senate relied heavily on 
a randomness component. Both offer lessons 
and help us refine our understanding of how 
to keep bias from creeping into the process. 
Once these structural decisions are made, 
we must still contend with the fact that de- 
vising electoral maps is an intricate process, 
and IRCs generally lack the expertise that 
politicians and their staffs have cultivated 
from decades of experience. In addition, as 
the bitter partisanship of the 2011 Arizona 
citizen commission demonstrated, without 
a method to assess the fairness of propos- 
als, IRCs can easily deadlock or devolve into 


“We must ensure that the science 


behind redistricting does not, itself, 
become partisanship’s latest victim.” 


lengthy litigation battles (6). New techno- 
logical tools can aid IRCs in fulfilling their 
mandate by compensating for this experi- 
ence deficiency as well as providing a way to 
benchmark fairness conceptualizations. 

To maintain public confidence in their pro- 
cesses, IRCs would need to specify the criteria 
that guide the computational algorithm and 
implement the iterative process in a transpar- 
ent manner. Open deliberation is crucial. For 
instance, once the range of maps is known 
to produce, say, a seven-to-eight likely split 
in Democrat-to-Republican seats 35% of the 
time, an eight-to-seven likely Democrat-to- 
Republican split 40% of the time, and some- 
thing outside these two choices 25% of the 
time, how does an IRC choose between these 
partisan splits? Do they favor a split that pro- 
duces more compact districts? How do they 
weigh the interests of racial minorities versus 
partisan considerations? 


LEVELING THE PLAYING FIELD 
Regardless of what technology may be de- 
veloped, in many states, the majority party 
of the state legislature assumes the primary 
role in creating a redistricting plan—and 
with rare exceptions, enjoys wide latitude in 
constructing district lines. There is neither a 
requirement nor an incentive for these self- 
interested actors to consent to a new process 
or to relinquish any of their constitutionally 
granted control over redistricting. 

All the same, technological innovation 
can still have benefits by ameliorating infor- 
mational imbalance. Consider redistricting 


Ohio’s 16 congressional seats. A computa- 
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tional analysis might reveal that, given some 
set of prearranged criteria (e.g., equal popu- 
lation across districts, compact shapes, a mi- 
nority district, and keeping particular com- 
munities of interest together), the number of 
Republican congressional seats usually ends 
up being 9 out of 16, and almost never more 
than 11. Although the politicians could still 
then introduce a map with 12 Republican 
seats, they would now have to weigh the po- 
tential public backlash from presenting elec- 
toral districts that are believed, a priori, to be 
overtly and excessively partisan. In this way, 
the information that is made more broadly 
known through technological innovation 
induces a new pressure point on the system 
whereby reform might occur. 

Although politicians might not 
welcome the changes that technology 
brings, they cannot prevent the ush- 
ering in of a new informational era. 
States are constitutionally granted the 
right to enact maps as they wish, but 
their processes in the emerging digital 
age are more easily monitored and as- 
sessed. Whereas before, politicians ex- 
ploited an information advantage, sci- 
entific advances can decrease this disparity 
and subject the process to increased scrutiny. 


PERVERSION VERSUS PROMISE OF SCIENCE 
Although science has the potential to 
loosen the grip that partisanship has held 
over the redistricting process, we must en- 
sure that the science behind redistricting 
does not, itself, become partisanship’s lat- 
est victim. Scientific research is never easy, 
but it is especially vulnerable in redistrict- 
ing where the technical details are intri- 
cate and the outcomes are overtly political. 

We must be wary of consecrating research 
aimed at promoting a particular outcome 
or believing that a scientist’s credentials ab- 
solve partisan tendencies. In redistricting, it 
may seem obvious to some that the majority 
party has abused its power, but validating 
research that supports that conclusion be- 
cause of a bias toward such a preconceived 
outcome would not improve societal gover- 
nance. Instead, use of faulty scientific tests as 
a basis for invalidating electoral maps allows 
bad actors to later overturn good maps with 
the same faulty tests, ultimately destroying 
our ability to legally distinguish good from 
bad. Validating maps using partisan prefer- 
ences under the guise of science is more dan- 
gerous than partisanship itself. 

The courts must also contend with the 
inconvenient fact that although their judg- 
ments may rely on scientific research, sci- 
entific progress is necessarily and excruci- 
atingly slow. This highlights a fundamental 
incompatibility between the precedential 
nature of the law and the unrelenting 
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need for high-quality science to take time 
to ponder, digest, and deliberate. Because 
of the precedential nature of legal deci- 
sion-making, enshrining underdeveloped 
ideas has harmful path-dependent effects. 
Hence, peer review by the relevant scien- 
tific community, although far from perfect, 
is clearly necessary. For redistricting, tech- 
nical scientific communities as well as the 
social scientific and legal communities are 
all relevant and central, with none taking 
over the role of another. 

The relationship of technology with the 
goals of democracy must not be underap- 
preciated—or overappreciated. Technological 
progress can never be stopped, but we must 
carefully manage its impact so that it leads 
to improved societal outcomes. The indis- 
pensable ingredient for success will be how 
humans design and oversee the processes we 
use for managing technological innovation. 
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PERSPECTIVE 


Campaigns influence election 
outcomes less than you think 


Campaigns have small effects but are built to win close races 


By David W. Nickerson! and Todd Rogers? 


.S. presidential campaigns spend 

hundreds of millions of dollars each 

election cycle to maximize their 

chance of electoral victory. Media 

coverage analyzes individual cam- 

paign advertisements, activities, and 
decisions as if they are hugely influential. 
Yet, whether an election is close or not is 
due to factors that are outside the control of 
electoral campaigns, such as wars and pan- 
demics or even candidate characteristics. 
In fact, roughly two-thirds of the variance 
in U.S. presidential election outcomes— 
where both sides always run substantial 
campaigns and frame these fundamentals 
for voters—can be explained by simple 
models using just economic performance 
and whether the incumbent is running (1). 
Several strands of academic literature may 
support a perception that some small cam- 
paign decisions can make big differences 
in voter attitudes and behaviors [e.g., how 
arguments are framed (2) or where field 
offices are placed in battleground states 
(3)]. This work likely overstates the effect 
of campaigns in the field, though, because 
it isolates specific elements from the cha- 
otic din of real-world politics and therefore 
either cannot control for the endogenous 
strategic decisions campaigns make or 
does not occur in environments when vot- 
ers’ partisan identities are fully activated. 
By pulling together disparate strands of 
research and situating presidential cam- 
paigns in their broader electoral, social, 
and media contexts, we argue that sizable 
persuasive effects from campaign activities 
seem very unlikely to be observed in real- 
world elections (4). 

Partisanship is the most important deter- 
minant of vote choice and is an extremely 
stable trait. Strong partisans (roughly 40% 
of the population) are deeply committed 
to their political beliefs and preferences, 
which makes them extraordinarily nonre- 
sponsive to electoral persuasion from the 
other side but excellent candidates for mo- 
bilization. But even when targeting people 
with weaker partisan attachments (~50%), 
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campaign communications have difficulty 
overcoming the psychology of partisanship. 
First, people prefer to consume messages 
consistent with their partisan identities, 
which makes contact difficult, even through 
paid advertising (5), a finding that holds 
true even in online outlets (6). 

Second, even when campaigns reach 
their intended persuasion targets, partisan- 
motivated reasoning counteracts accep- 
tance of the appeals. Affective polarization 
(i.e., the difference in how warmly people 
feel toward their own party and the oppos- 
ing party) and negative partisanship (i.e., 
the extent to which people dislike the op- 
position) lead partisans to automatically 
dislike, distrust, and resist communications 
from members of the opposing party (7), to 
the point of dehumanizing the opposition 
(8). This leads partisans to reject counter- 
partisan messages, even when these mes- 
sages align with their political values (9). 

Finally, the roughly 10% of the popula- 
tion that lack attachment to a party—and 
the polarizing cognitive processes that 
come with such attachment—should make 
nonpartisans ideal targets for persuasion. 
However, these “true independents” are 
relatively less interested in politics and ac- 
tively avoid political content in daily life. 
Thus, they are rarely exposed to campaign 
messages and often respond negatively 
to partisan outreach, not because of ideo- 
logical reasons but because they tend to 
find politics generally objectionable (J0). 
Whether these individuals are nonpartisan 
because they dislike politics or vice versa is 
an open question that can be addressed as 
long-term political panel surveys get more 
numerous and run longer. 

Campaigns segment the electorate into 
groups to target for different purposes: 
convincing strong supporters to volunteer 
and donate; mobilizing less engaged sup- 
porters to vote; persuading nonsupport- 
ers. But the crowded communication en- 
vironment moderates the effects of these 
efforts. Countermessaging by opponents 
can eliminate initial persuasive effects of 
political messaging and reduces a message’s 
persuasive effects by casting doubt on the 
veracity of basic facts (17). Over the course 
of an election cycle, affective partisan po- 
larization increases by 50 to 150% (12); this 
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conflicting and constant partisan commu- 
nication may be part of the cause. 

Even messaging from allied groups can 
diminish the marginal effect of campaign 
communications. Organizations doing voter 
outreach all draw on the same databases and 
use similar techniques to model the elector- 
ate, so allied campaigns tend to talk to the 
same people (13). The net result of this mi- 
crotargeting is that an individual voter may 
receive dozens of contacts from multiple 
groups all advocating for similar positions, 
which diminishes the marginal effect of ev- 
ery single contact, potentially completely. 

The very ubiquity of campaign communi- 
cation makes studying the aggregate effects 
of campaign efforts very difficult. Field ex- 
periments may be able to tell us the marginal 
effects of specific campaign communication 
tactics but cannot estimate the presence or 
absence of a campaign as whole or big pic- 
ture messaging decisions. Nearly blanket 
coverage of targeted voters and strategic 
targeting by campaigns ensure that nontar- 
geted voters are omitted purposefully and do 
not serve as a good baseline for estimating 
the effect of campaign targeting. 

Whereas campaigns communicate their 
messages directly to voters through many 
channels (e.g., mail, phone calls, TV adver- 
tisements, online advertisements, and social 
media accounts), most voters experience 
elections through media that campaigns 
do not control (e.g., written news sources, 
TV programs, social media, etc.). Although 
experiments that allow subjects to select 
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their exposure show that political program- 
ming has no measurable effect on viewers 
because people select shows that they enjoy 
and do not expose themselves to uncom- 
fortable material, a recent experiment in 
which participants were randomly assigned 
to watch partisan media and later randomly 
assigned to discussion groups showed that 
exposure to partisan media polarized the 
opinion of discussion partners who were 
not exposed to the media (1/4). There are 
certainly patterns to what social media nar- 
ratives are picked up by mainstream media 
outlets, but what messages are ultimately 
amplified by both social media and main- 
stream media is outside the campaign’s 
direct control. How campaigns most effec- 
tively shape “the media narrative” is an area 
ripe for future research. 

Campaigns for offices further “down the 
ballot” (and in other settings) are likely to 
see larger effects than seen in U.S. presiden- 
tial campaigns because the offices are less 
polarized, the media give these races less at- 
tention, voters have less information about 
the candidates and fewer cues, and there 
are fewer outside groups devoting resources 
to the election, so communication channels 
are less crowded. The fact that U.S. presi- 
dential campaigns spend so much money 
and effort on activities with objectively 
small direct effects is a testament to the in- 
credible value of wielding political power. 
The U.S. federal government has an annual 
budget of over $4 trillion and regulates 
nearly every facet of economic and social 
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life in some manner. Given this fact, hun- 
dreds of millions of dollars, or even billions, 
spent to influence who controls the execu- 
tive branch in the event that an election is 
close may seem justifiable to both donors 
and candidates, especially when the parties 
hold very different policy preferences. ® 
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Diversity and prosocial behavior 


Delia Baldassarri* and Maria Abascal 


Immigration and globalization have spurred interest in the effects of ethnic diversity in Western societies. 
Most scholars focus on whether diversity undermines trust, social capital, and collective goods provision. 
However, the type of prosociality that helps heterogeneous societies function is different from the in-group 
solidarity that glues homogeneous communities together. Social cohesion in multiethnic societies depends on 
whether prosocial behavior extends beyond close-knit networks and in-group boundaries. We identify two 
features of modern societies—social differentiation and economic interdependence—that can set the stage for 
constructive interactions with dissimilar others. Whether societal adaptations to diversity lead toward 
integration or division depends on the positions occupied by minorities and immigrants in the social structure 
and economic system, along with the institutional arrangements that determine their political inclusion. 


ost Western countries already are or 
are destined to become multiethnic 
societies thanks to recent patterns of 
migration and globalization. Growing 
immigration to North America and 
Western Europe (Fig. 1A) has commanded par- 
ticular attention. Increased ethnic heterogeneity 
has renewed scholarly interest in intergroup 
dynamics of cooperation and discrimination 
and spurred debates over the consequences of 
ethnic diversity for social trust and de- 
mocratic integration. Many scholars 
have concluded that ethnic diversity 
negatively affects overall levels of trust, 
social capital, and public goods provi- 
sion. Instead, we see these changes as 
an opportunity to ask a more important 
question: How does prosocial behavior 
extend beyond the boundaries of the in- 
group and to unknown and dissimilar others? 
Answering this question is the key to achieving 
solidarity and cooperation in the heterogeneous 
communities we increasingly inhabit today. 


Cooperation in heterogeneous versus 
homogeneous societies 


To function, large collectivities need to foster 
solidarity and cooperation among their mem- 
bers. Most theories of political order—from 
Enlightenment theories of the social contract 
(Hobbes and Rousseau) and Tocqueville’s 
Democracy in America to recent work on civil 
society and social capital—acknowledge the 
need for a sense of collective identity that 
allows trust and solidarity to extend beyond 
the boundaries of the family or clan to the 
larger community or nation. How does this 
come about? According to popular models of 
human behavior, repeated interactions within 
groups and close-knit networks facilitate the 
emergence of a shared culture, norms of rec- 
iprocity and cooperation, and peer sanctioning, 
inducing positive outcomes for the collectiv- 
ity (D). Homogeneous communities readily nur- 
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ture trust and solidarity through these avenues. 
In heterogeneous communities, by contrast, 
social ties between noncoethnics are sparser, 
which limits coordination and social control. In 
addition, social norms might not be shared across 
ethnic boundaries, or there might be uncertainty 
among members regarding the extent to which 
they are shared (2). Seen in this light, it makes 
sense to think of diversity as a challenge to the 
foundations of our collective social contract. 


“The key to solidarity and cooperation... 


is the extension of prosociality beyond 


” 


close-knit networks... 


Nevertheless, most heterogeneous communi- 
ties still manage to get along. As homogeneous 
communities become less prevalent and more 
people experience life in diverse contexts, we 
need to move beyond traditional understandings 
of prosociality. In order to achieve solidarity and 
cooperation, diverse communities may not rely 
on the same mechanisms as homogeneous ones. 
More than a century ago, in fact, Durkheim ar- 
gued that solidarity in complex, differentiated 
societies relies primarily on interdependence 
and the division of labor rather than on cultural 
similarity and mutual acquaintanceship (3). 
Following this lead, we identify two features 
of modern societies that have the potential to 
foster generalized prosociality. 

The first feature is social differentiation, 
which refers to the growing number of iden- 
tities and group affiliations that people have 
in their lives. As first theorized by Simmel, in 
modern societies individuals become less de- 
termined by a few ascribed categories—such as 
race, class, or gender—and experience a greater 
ability to choose their group affiliations. As 
people emancipate from family and commu- 
nity ties, out of choice or necessity, the number 
of unknown, distant others they will interact 
with increases, and this has been shown to 
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foster generalized prosociality (4, 5). A second, 
related feature is economic interdependence: 
Market-integrated societies in which strangers 
regularly engage in mutually beneficial tran- 
sactions exhibit greater levels of generalized 
solidarity and trust (6, 7). 

We should not take for granted that so- 
cieties will inevitably adapt to increasing di- 
versity in ways that further social integration. 
Critically important for social integration is 
the extent to which ethnic differences map 
onto class, religious, gender, or other differ- 
ences. Differentiation brings about social 
integration when lines of social division are 
cross-cutting—that is, when ethnic group mem- 
bership does not wholly predict member- 
ship in specific class, religious, gender, or other 
groups. By contrast, when social cleavages are 
consolidated, differentiation poses a threat 
to social integration (8) and democratic sta- 
bility (9). Ethnic diversity may thereby foster 
social division. 

Indeed, existing studies on the effects of 
ethnic diversity tend to highlight its negative 
consequences for social capital, economic growth, 
and public goods provision. We start by re- 
viewing this literature, which has dominated 
the debate regarding the consequences 
of ethnic diversity in Western socie- 
ties. However, to fully understand the 
conditions under which heterogeneous 
societies can achieve social cohesion 
across lines of ethnic differentia- 
tion, we also need to take stock of 
the status of immigrants and native 
minorities. Then, we discuss how dif- 
ferentiation and economic interdependence— 
two core features that emerge in modern 
societies—set the stage for a new kind of pro- 
sociality that extends beyond the confines of 
the in-group by enhancing the opportunities 
for intergroup contact, encouraging superordi- 
nate identification, and inhibiting in-group- 
out-group thinking. Overall, we argue that the 
type of prosociality that helps heterogeneous 
societies function likely derives from positive 
experiences in the context of strategic inter- 
actions, such as those in the workplace, and is 
different from the in-group solidarity that glues 
homogeneous communities together. 


The “problem” of diversity 


Political economy scholars have looked to ethnic 
diversity in their attempts to explain societal 
problems in developing countries, including 
violent conflicts and stalled economic growth 
(10). On the whole, however, studies paint a 
nuanced picture, one in which poverty and 
political instability, rather than ethnic or re- 
ligious divisions, increase the risk of civil war 
(1) and in which ethnic fractionalization is 
associated with lower growth only in the ab- 
sence of robust democratic institutions and 
policies (12, 13). 
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A second line of work, which focuses mainly 
on Western European and North American 
countries, instead probes within-country dif- 
ferences across homogeneous and heteroge- 
neous communities. These studies typically 
report negative associations between ethnic 
diversity and desirable outcomes, including civic 
engagement (14), public goods provision (15), 
and self-reported trust (76). On the association 
between diversity and trust alone, a recent 
review covers nearly 90 studies (17). Although 
effect sizes are minimal, this scholarship often 
reaches alarming conclusions about the erosion 
of civic life at the hands of ethnic diversity. 

However, in Western countries, homoge- 
neous and heterogeneous communities differ 
in systematic ways, which cautions against con- 
cluding that diversity per se has negative ef- 
fects. For one, heterogeneous communities are 
disproportionally nonwhite, economically dis- 
advantaged, and residentially unstable. Com- 
positional effects related to these differences 
largely account for the relationship between 
ethnic diversity and collective outcomes. For 
example, nonwhites and immigrants tend to re- 
port lower trust, and they are overrepresented 
in heterogeneous communities. Once analyses 
account for the fact that native whites, who are 
disproportionately represented in homogeneous 
communities, also score higher on prosocial 
indicators, negative associations with ethnic 
diversity are strongly reduced and even dis- 
appear. Similarly, economic hardship takes a toll 
on prosocial engagement, and diverse commu- 
nities have much higher rates of concentrated 
poverty (18). Overall, economic indicators are 
by far stronger predictors of collective out- 
comes than are ethnoracial indicators (3, 19). 

More generally, the consequences of ethnic 
diversity likely depend on the extent to which 
ethnicity constitutes one of many lines of dif- 
ferentiation or instead operates as an organiz- 
ing principle around which resources are 
distributed. It matters whether ethnicity inter- 
sects with other lines of division and, especial- 
ly, economic inequality. In their investigation 
of public goods provision, Baldwin and Huber 
found that economic inequality between 
groups—rather than ethnolinguistic or cul- 
tural differences—undermines welfare pro- 
vision (20). They speculate that this happens 
because richer, more powerful groups prioritize 
different public goods and exclude others from 
access. Therefore, resource asymmetries between 
ethnic groups, and not the multiplicity of ethnic 
groups per se, undermine collective efforts. 

Ethnic fractionalization has been and re- 
mains relatively low in Western Europe and 
North America compared with several coun- 
tries in Africa and Asia (Fig. 1B). The focus on 
Western countries is mostly driven by grow- 
ing immigration (Fig. 1A). Hence, to date, 
systematic ethnoracial differences between 
homogeneous and heterogeneous commu- 
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nities are an artifact of studying diversity in 
contexts such as North America and Europe, 
where heterogeneity is relatively low and homo- 
geneous communities are, by and large, homo- 
geneously native majority communities. 

It follows that although they use measures 
of heterogeneity and make claims about diver- 
sity, studies in Western countries are unable 
to attribute observed associations to hetero- 
geneity, as opposed to immigrant or minority 
share. As a result, studies of ethnic diversity 
rehash the findings of a long-standing liter- 
ature on how native majorities react to the 
growing presence of immigrants and minor- 
ities. This literature links the size and growth 
of immigrant and minority populations to 
perceived threat and greater hostility toward 
them. For example, survey and laboratory ex- 
periments found that U.S. whites who are ex- 
posed to information about the growing share 
of nonwhites express greater opposition to 
policies and parties seen to benefit nonwhites 
(21). Observed effects are theorized to stem 
from broad concerns about native majorities’ 
economic well-being, their cultural dominance, 
and their symbolic status within an intergroup 
hierarchy from which they derive social and 
psychological benefits (22). 

Diversity, as both a concept and measure, 
treats groups interchangeably; a community 
that is 80% white and 20% Black is as diverse 
as one that is 80% Black and 20% white and 
one that is 80% Latino and 20% Asian (/8). 
However, where there is differentiation, there 
is hierarchy: Native majorities, native minorities, 
and immigrants occupy different positions in 
the social order. Because intergroup dynamics 
tend to reproduce status and power asymmetries 
(23), the dynamics of similarly heterogeneous 
communities likely vary according to the specific 
groups represented and their relative sizes. 
Hierarchy raises another consideration: In 
heterogeneous contexts, we need to distinguish 
between benefits that accrue to single groups and 
those that extend to the whole collectivity (3). 

Taken together, these observations caution 
against making generic claims about the ef- 
fects of diversity. To ascertain the challenges 
and possibilities posed by diversity, we first 
need to disentangle its effects from those of 
inequality. This entails understanding the so- 
cial cleavages and asymmetries that govern 
intergroup relationships in diverse societies. 


Immigrants and native minorities in 
Western countries 


To what extent and in what domains have 
immigrants and native minorities achieved 
economic, political, and social membership 
in Western countries? 

In the United States, immigrants (primarily 
from Latin America and Asia) and native mi- 
norities (primarily Black Americans) con- 
tribute to present-day diversity. Regarding the 


4 September 2020 


experience of immigrants, scholars are split 
between those who contend that today’s im- 
migrants are on the same upward trajectory 
as earlier Europeans (24) and those who read, 
from some groups’ experiences, evidence of 
stalled or even downward mobility (25). Evi- 
dence of integration comes from the advances 
made by members of the second generation 
over their immigrant parents (26). However, 
longer-term views into the third generation or 
later reveal remarkable marital homogamy as 
well as network and residential segregation for 
some groups, such as Mexican Americans (27). 

The experience of Black Americans, the 
largest native minority group in the United 
States, challenges the expectation that full 
economic, political, and social membership 
necessarily await later-generation Americans. 
Black households have less wealth and lower 
incomes than do Asian or Latino households. 
And despite recent gains, Blacks are still less 
likely to marry whites and more likely to be 
residentially segregated from whites than 
are Asians or Latinos. Persistent, intergen- 
erational disadvantage among Blacks is a 
consequence of past institutional practices, 
including Jim Crow segregation and red- 
lining (28), present institutional practices 
such as mass incarceration, and contempo- 
rary discrimination in the labor market and 
other domains (29). 

In Europe, immigrants from Turkey, Africa, 
and other regions, including former colonies, 
contribute to diversity. Their prospects for in- 
tegration are sobering (30). Evidence of upward 
economic mobility is tempered by gaps in 
employment and earnings that may persist 
into later generations (37). A growing body of 
field experimental research uncovers discrim- 
ination against immigrants, especially Muslim 
immigrants and/or those of Arab origin, in 
formal markets such as those for employment 
and housing (32) and informal, everyday in- 
teractions (33, 34). Hostility toward certain 
immigrant groups is sometimes motivated by 
their observance and transmission of religious 
practices and cultural norms that are seen 
to conflict with liberal principles of gender 
equality and individual freedom (33, 35). These 
findings fuel the view that European societies 
are converging on a “discriminatory equilib- 
rium” in which discrimination toward some 
groups drives underinvestments in human 
capital (30) and furthers the reproduction of 
values and practices that stall integration in 
economic and other domains. 

The picture is not all negative, however. First, 
it is worth acknowledging that persistent, 
later-generation gaps in educational attain- 
ment, employment, and earnings coexist with 
substantial upward mobility, especially between 
the first and second generations (24). Second, 
legal status can go a long way toward securing 
economic mobility, as evidenced by the diverging 
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earnings trajectories of undocumented im- 
migrants and legal permanent residents in the 
United States as well as the rise in earnings 
induced by amnesty laws (26). When it comes to 
political incorporation, government efforts to 
promote citizenship, whether aimed directly 
at immigrants or at the community organiza- 
tions that serve them, boost naturalization and 
participation through material and symbolic 
channels—that is, by signaling immigrants’ 
suitability for inclusion (36). 

When such resources are not available or 
when discrimination is prevalent, attachment 
to a protective “ethnic core” may provide im- 
migrants and minorities one path to economic, 
political, and cultural mobility (27, 37). How- 
ever, insofar as enclaves reproduce segregation 
and contribute to discrimination by native 
majorities toward immigrants and minorities, 
they are a suboptimal and short-term reprieve 
to the challenges posed by diversity. A more 
robust solution for the successful integration 
of immigrants and minorities in multiethnic 
societies builds on the features of modern 
societies that facilitate cooperative encounters 
and shared interests across group boundaries. 


Toward a theory of prosociality in 
multiethnic societies 


The key to solidarity and cooperation in het- 
erogeneous communities is the extension of 
prosociality beyond close-knit networks and 
in-group boundaries to unknown, dissimilar 
others. The large-scale interdependence of life 
in modern societies requires that individuals 
follow universal norms of reciprocity and 
cooperation rather than rely on mutual ac- 
quaintanceship or group identification. The ob- 
servance of such norms is assured by the presence 
of strong coordinating institutions; for example, 
we rely on public transportation not because we 
know the bus driver or identify with them but 
because we trust that they will competently per- 
form the job that corresponds to their role (3). 
The type of prosociality that helps hetero- 
geneous communities function is different 
from the in-group solidarity that glues homo- 
geneous communities together. A large schol- 
arship has documented the parochial nature 
of human altruism, convincingly showing that 
in-group preferences are a staple of human 
behavior (38). From an evolutionary perspec- 
tive, parochial altruism emerged from the 
coevolution of intergroup favoritism and out- 
group hostility during periods of violent in- 
tergroup conflict (39). Although in-group 
favoritism may have served us well in small- 
scale societies, it cannot get us far in complex, 
large-scale societies characterized by hetero- 
geneity. For diverse societies to function, they 
must to some extent suppress members’ reli- 
ance on in-group identification as the primary 
basis for prosocial behavior (40). Prosocial be- 
havior in complex societies likely derives from 
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positive experiences in the context of strategic 
interactions, such as those in the workplace, 
rather than empathic identification (47). People 
in modern societies are often pushed outside 
the comfort zones of their familiar networks 
to constructively interact with unknown and 
dissimilar others. We have learned, froma 
rich literature on intergroup contact, that such 
interactions have the potential to reduce 
prejudice, especially under favorable condi- 
tions, including equal status, common goals, 
and lack of competition (42). Here, we discuss 
how social differentiation, a macrostructural 
feature of modern societies, may favor the 
emergence of generalized prosociality and the 
special role that market integration and eco- 
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nomic interdependence can play in facilitating 
productive intergroup interactions. 
Differentiation may be the key, not an ob- 
stacle, to social cohesion in modern societies 
because an increase in the dimensions of dif- 
ferentiation might bring about greater social 
integration. A greater number of identities 
and affiliations brings about distinct combina- 
tions that can foster even greater cooperation 
(8). This, however, occurs only when the lines 
of differentiation are cross-cutting, whereas 
division follows from consolidated lines of 
differentiation (Fig. 2). Ethnic heterogeneity 
can push societies toward either pole. On the 
one hand, when ethnic differences overlap with 
status and resource differences, in-group favoritism 


Fig. 1. Ratio of migrant stock and ethnoracial fractionalization by countries. (A) Ratio of international migrant 
stock (1990/2015). Europe and North America saw relatively large increases in national stocks of international 
migrants in the past two decades. International migrant stock refers to the percentage of foreign-born residents 

in a given year. Orange indicates higher ratios of migrant stock; teal indicates lower ratios of migrant stock. 

[Data source: United Nations Population Division] (B) Ethnoracial fractionalization (2013). Fractionalization is higher 
in sub-Saharan Africa and parts of Asia than in Europe or North America. Fractionalization corresponds to the 
probability that two randomly chosen residents belong to the same ethnoracial group. Darker colors represent higher 
ethnoracial fractionalization. [Data source: Historical Index of Ethnoracial Fractionalization] 
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Fig. 2. Social differentiation leads to greater integration when dimensions of differentiation are cross- 
cutting. (A to C) The top layers represent various group identities that individuals might have in modern societies 
(such as ethnicity, class, or sexuality), and the bottom layer describes the social network that emerges from shared 
membership in these groups. In (A), the two dimensions of differentiation are consolidated and thus bring about 
social fragmentation. In (B) and (C), the dimensions are cross-cutting, thus favoring social integration. As the number 
of cross-cutting dimensions increases [(comparing (C) with (B)], so does overall network integration. 


can operate more efficiently. But far from bind- 
ing people together (as it does in homogeneous 
societies), in-group favoritism would deepen 
inequality and division in heterogeneous ones. 
On the other hand, when heterogeneity along 
ethnic lines cross-cuts differences in terms of 
class, politics, and other dimensions, it both 
neutralizes in-group favoritism and deepens 
interdependence, fostering cohesion. 


Social differentiation 


Social differentiation refers to the multiplicity 
of identities and roles that individuals may 
acquire and inhabit in their day-to-day lives 
and often leads to greater individualization. 
Namely, people’s ability to choose, with relative 
freedom, their identities and group affiliations 
increases, and their profiles become distinctive. 
When lines of differentiation are cross-cutting, 
the process of differentiation and individu- 
alization sets the stage for broad-based cohe- 
sion through at least three pathways. 

The first is by facilitating interpersonal con- 
tact beyond close-knit, kinship ties and with 
others who are dissimilar in terms of some 
identities, including, most notably, ethnicity. 
Research supports the claim that generalized 
trust and other benefits flow from interactions 
outside dense networks, such as those based 
on kinship. Cross-societal comparisons have 
documented greater generalized trust and co- 
operation in an individualistic society such as 
the United States than in Japan, where moni- 
toring and sanctioning happen primarily within 
the confines of close, long-term relationships 
(4). According to Yamagishi’s emancipatory 
theory of trust, strong ties, which are typical 
of collectivist societies such as Japan, produce 
a sense of security within the group but 
prevent trust from developing beyond group 
boundaries. Similarly, people with strong fam- 
ily and group ties display lower levels of trust 
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toward generalized others in incentivized ex- 
periments. By contrast, people who are less 
embedded in family networks and those who 
have experienced uprooting events, such as 
divorce, are more likely to trust strangers, 
possibly because they have more opportuni- 
ties and incentives to engage in relationships 
with unknown others (5). More broadly, semi- 
nal work on social networks has exposed the 
limits of strong ties and close-knit social rela- 
tionships (43, 44). This work shines a positive 
light on weak ties and network positions of 
brokerage for their ability to connect parts of a 
social network that would be otherwise dis- 
connected, facilitating access to a broader range 
of information and opportunities. To quote 
Granovetter, “Weak ties, often denounced as 
generative of alienation...are here seen as 
indispensable to individuals’ opportunities 
and to their integration into communities; 
strong ties, breeding local cohesion, lead to 
overall fragmentation” [(43), p. 1378]. 

The second pathway through which social 
differentiation may foster cohesion is through 
identification, with or without direct interper- 
sonal contact. In laboratory studies, procedures 
that encourage identification with a common 
(or “superordinate”) identity have been shown 
to reduce prejudice across group boundaries 
(45). This is possible when cross-cutting affi- 
liations enable identification with a category 
that spans ethnic boundaries. An outstanding 
question is whether identification with a super- 
ordinate category can somehow achieve deeper 
trust and cooperation than can lower-level eth- 
nic identification, perhaps by “training” indi- 
viduals to be more flexible about categorization 
in general. If not, superordinate identification 
may be an imperfect solution that trades favor- 
itism toward one group for favoritism toward 
another, larger group. These aspects are ripe for 
further testing in field settings (46). 
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A third pathway consists in subverting 
humans’ deep-seated capacity to think (and 
act) in terms of in-group-out-group catego- 
ries. Category-based inconsistencies—for ex- 
ample, the Harvard-educated, first-generation 
Latina—inhibit the cognitive processes that 
compel us to frame encounters in “us versus 
them” terms, opening the door to more elab- 
orate cognitive processes in which an alter is 
more likely to be perceived as “an individual 
rather than an (oppositional) group member” 
[(40), p. 854]. The distinction between this 
pathway and one that hinges on a common 
identity is subtle: Category-based inconsisten- 
cies can subvert “us versus them” thinking 
even if we do not share identities or exper- 
iences with a target—that is, even if we are 
neither Ivy League-educated, nor Latino, nor 
the first in our family to attend college. 

Critically, the most effective way to secure 
multiethnic cohesion through this channel 
is not to promote a few minorities but rather 
to weaken the covariance between ethnic cat- 
egory membership and life chances writ large— 
that is, to cultivate a system in which a first- 
class education is equally accessible to whites 
and nonwhites, regardless of their family 
background. There is growing evidence that 
cross-cutting affiliations can mitigate bias 
against immigrants and minorities. Experi- 
mental evidence shows that U.S. Americans 
report greater willingness to admit immigrants 
who are highly educated or have high-status 
jobs (47). Relatedly, high socioeconomic sta- 
tus mitigates mistrust toward Blacks in a 
cooperative investment game (48), and signals 
of cultural integration mitigate bias toward 
Muslims in Germany (33). 

Taken together, the hypothesized pathways 
are consistent with a model of social cohesion 
in which cross-cutting differentiation, rather 
than social closure, is the unifying force. When 
social cleavages are not cross-cutting but in- 
stead consolidated—for example, when minori- 
ties and immigrants are systematically deprived 
of educational and employment opportunities 
and thereby relegated to the lower tiers of the 
social hierarchy—disadvantaged groups will con- 
tinue to be cast in a separate and marginalized 
social category and discriminated against. 


Economic interdependence 


Economic exchanges are the quintessential 
setting for meaningful, cooperative interac- 
tions between dissimilar others. This is partly 
because of the specific nature of economic 
transactions: They occur between parties who 
have different goods (or skills) to exchange 
and thereby bring together people who may 
not belong to the same social circles. Along 
these lines, workplace relationships tend to 
be less homophilious than relationships in 
other settings. Moreover, intergroup encount- 
ers in economic settings seem to be particularly 
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conducive to generalized prosociality. In a 
series of cross-cultural studies, Henrich and 
his colleagues uncovered less prosocial behav- 
ior in small-scale societies based on kinship 
networks than in market-integrated socie- 
ties in which strangers regularly engage in 
mutually beneficial transactions. In their 
words, “The more frequently people experi- 
ence market transactions, the more they will 
also experience abstract sharing principles 
concerning behaviors toward strangers” [(6), 
p. 76)]. Market integration not only fosters 
prosociality toward unknown others; it can 
also shift boundaries to include noncoeth- 
nics. In a nationwide field experiment in Italy, 
market integration explained variation in 
prosocial behavior toward both natives and 
immigrants (7). Similar effects are imputed to 
globalization, understood as greater world- 
wide connectedness (49). 

Workplaces, more than homes or neighbor- 
hoods, may be crucial for fostering the type of 
prosociality that holds modern societies to- 
gether. Minorities’ and immigrants’ positions 
in the productive system and their prospects 
for social mobility—including employment op- 
portunities in complementary sectors, and 
a legal regime that protects their rights as 
workers—are therefore important not only 
for their own material success but for society 
as a whole. The economic integration of mi- 
norities and immigrants also determines the 
extent to which they come to identify with 
mainstream society (50). 

Most economic exchanges—for example, 
hiring someone or renting an apartment from 
them—are strategic in nature, in the sense 
that a person’s behavior is affected by their 
expectations of the alter. These types of inter- 
action entail risk and uncertainty because 
people have to overcome difficulties related 
to coordination, lack of information, and mis- 
trust. Cooperative and prosocial behavior in 
these settings may still be affected by in-group 
favoritism but are also based on consider- 
ations that go beyond whether an ego likes or 
dislikes the alter, to encompass the alter’s 
trustworthiness, competence, and reputation 
(40). This calls for a deeper understanding 
of intergroup dynamics, and the institutional 
arrangements, that favor prosocial outcomes 
in the context of strategic interactions. Some 
field experimental work has made progress in 
this direction; for example, in a study of pub- 
lic goods provision in diverse Ugandan neigh- 
borhoods, Habyarimana and colleagues used 
behavioral games to disentangle the various 
motives and mechanisms that bring about 
collective action in multiethnic contexts (2). 
Although they did not find evidence of ethnic 
favoritism, they found that the reciprocity 
norms and sanctioning opportunities that fa- 
cilitate cooperation in risky interactions are 
stronger among coethnics than noncoethnics. 
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Market integration enhances opportuni- 
ties for productive interactions across group 
boundaries. Additionally, the strategic nature 
of economic exchanges elicits decision-making 
processes that go beyond in-group favoritism, 
therefore providing new venues for institu- 
tional intervention. 


Conclusion 


We can approach ethnic diversity through the 
lens of lost homogeneity. From this perspec- 
tive, we understand that members of the white 
majority tend to react negatively to the growth 
of immigrants and minorities in their com- 
munities. However, it would be premature to 
conclude that diversity or diversification per 
se are to blame for declining levels of trust and 
cooperation. In the Western European and 
North American context, diversity is synon- 
ymous with immigrant and minority share 
and economic disadvantage, and statistical 
attempts at disentangling their effects will 
not get us very far. 

Beyond questioning the effects of ethnic di- 
versity, scholars should develop a theory of 
social cohesion in multiethnic societies that 
considers intergroup dynamics, social cleav- 
ages, and asymmetries in resources and power. 
Crucial to this effort is understanding the con- 
ditions under which prosocial behavior extends 
beyond close-knit networks and the safe con- 
fines of the in-group. Here, we have high- 
lighted two features of modern societies, social 
differentiation and economic interdependence, 
that set the stage for generalized prosociality 
to develop. We argue that, in contrast with the 
in-group solidarity that glues homogeneous 
communities together, prosociality in heter- 
ogeneous societies likely derives from posi- 
tive experiences in the context of strategic 
interactions. Further research is needed on 
the mechanisms and institutional arrange- 
ments that foster this higher-level form of 
cooperation. 

The experience of immigrants and minori- 
ties is instructive regarding the conditions and 
institutions that facilitate integration and mo- 
bility in Western societies. Of primary im- 
portance are employment opportunities in 
mainstream labor markets, especially under 
conditions of economic expansion, along with 
legal and political inclusion. Regrettably, it 
is precisely these conditions that are in short 
supply in a historical moment characterized 
by the rise of right-wing movements, an eco- 
nomic recession induced by a global pandemic, 
and long-standing institutional practices, 
such as those of law enforcement, that deepen 
the divides between ethnoracial groups. Wheth- 
er societal adaptation to diversity moves toward 
integration or social division depends as much 
on microinteractions on the ground as on the 
economic and political institutions that gov- 
ern these processes. 
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Can democracy work for the poor? 


Rohini Pande 


Millions of the world’s poorest people now live in middle-income democracies that, in theory, could use their 
resources to end extreme poverty. However, citizens in those countries have not succeeded in using the vote to 
ensure adequate progressive redistribution. Interventions aiming to provide the economically vulnerable with 
needed resources must go beyond assisting them directly, they must also improve democratic institutions so that 
vulnerable populations themselves can push their representatives to implement redistributive policies. Here, | 
review the literature on such interventions and then consider the “democracy catch-22”: How can the poor secure 
greater democratic influence when the existing democratic playing field is tilted against them? 


primary goal of development policy is to 

end extreme poverty across the world by 

ensuring that all people have the basic 

resources they need to live. In recent 

decades, sustained economic growth in 
some countries in the developing world has, in 
fact, allowed hundreds of millions of people to 
escape destitution. However, this growth has 
also left hundreds of millions of the poorest of 
the poor in limbo. By 2015, a majority of the 
world’s extreme poor were living in countries 
that had become middle-income in terms of 
per capita gross national income (Fig. 1A). Al- 
though the vast majority of these poor people 
live in countries with democratically 
elected governments, they appear to 
lack the actual political power that 
would enable them to claim a fair, or 
even adequate, share of their coun- 
tries’ growth (1) (Fig. 1B). 

The international development com- 
munity targets considerable aid toward 
low-income countries with the aim of 
helping their citizens move out of extreme pov- 
erty. Equally poor citizens in middle-income 
countries do not benefit from the same support. 
This is in part because citizens of rich countries 
are reluctant to donate to countries with obvious 
concentrations of wealth (a political constraint 
that results in, for example, the country income 
criteria that underlie grant and loan program- 
ming by most bilateral or multilateral aid agen- 
cies), and so international aid is far less likely to 
target the extreme poor in these countries (2). 

Meanwhile, these poorest citizens of middle- 
income countries have largely failed to receive 
an adequate share of their countries’ increased 
wealth (3, 4). Compared with high-income 
countries, lower-income countries tax less and 
spend fewer resources—both in absolute terms 
and as a percentage of gross domestic product 
(5). High-income countries spend relatively 
more on social protection and less on clientel- 
istic policies, which exchange resources for 
political support, and, overall, a smaller share 
of their spending is lost as a result of corrup- 


Department of Economics, Yale University, New Haven, CT, USA. 
Email: rohini.pande@yale.edu 


Pande, Science 369, 1188-1192 (2020) 


tion (6). India, the world’s largest democracy 
and a country that transitioned from low- to 
middle-income status in 2009, exemplifies this 
contrast: It is home to both the largest number 
of extreme poor in the world and the third- 
largest number of billionaires. 

Given that citizens in general, and vulnera- 
ble citizens in particular, perceive progressive 
redistribution as an essential characteristic of 
democracy (Fig. 2), and given that poor coun- 
tries have larger vulnerable populations than 
rich ones do, reason would lead one to expect 
elected officials in poor countries to enact 
more redistributive policies than officials in 


“If the poor lack the means...to use their 


de jure power effectively, then they will fail 
to negotiate a fair share of...growth.” 


rich countries. And yet the opposite is true. So 
why are low- and middle-income democratic 
states less responsive to the preferences of their 
poorer citizens? One possible reason is a state’s 
limited capacity to deliver antipoverty programs. 
The state may want to redistribute resources 
yet finds itself hobbled by low revenues, lim- 
ited manpower, and, potentially, a lack of tech- 
nical know-how (7). This reasoning suggests 
that a critical role of development policy is to 
strengthen state capacity for program delivery. 
Duflo has argued for a problem-by-problem 
approach: Economists should embrace the role 
of “plumbers” and identify ways of improving 
the so-called “plumbing” of the state machinery 
(8). In recent years, researchers have made con- 
siderable progress in using field experiments to 
identify evidence-based policies that “improve 
the plumbing”—progress that was recognized 
by the 2019 Nobel Prize in Economics (9). 
But fixing the state’s plumbing need not im- 
ply progressive redistribution. In reality, a state 
consists of groups of actors with often mis- 
aligned interests—and the state machinery that 
has been built for service delivery may not, in 
practice, deliver for the poor. In a democratic 
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state, power is negotiated through democratic 
institutions. If the poor lack the means or in- 
formation necessary to use their de jure power 
effectively, then they will fail to negotiate a fair 
share of the proceeds of growth. 

Consider a concrete example that, in keep- 
ing with the theme of plumbing, focuses on 
water. Suppose a state has successfully con- 
nected its citizens to the water mains. There 
is enough water to meet citizens’ basic water 
needs but not enough to allow farmers to grow 
water-guzzling cash crops, or to allow rich city 
dwellers to install lawn sprinklers, and at the 
same time provide adequate water to the poor. 
To allocate water resources, elected politicians 
negotiate rules for pricing or rationing water, 
and bureaucrats implement these rules. In a 
benevolent state with adequate capacity, such 
actions can ensure that all citizens—rich and 
poor—get the water they need. 

In practice, however, both the rules and their 
implementation depend on how well the state 
accommodates interests of groups with vary- 
ing economic and political power. The eco- 
nomic elite may use financial resources to lobby 
politicians and distort rules in their favor. Even 
if that is not the case, intermediaries may divert 
resources. The politicians rely on city engineers 
to install water meters and on local village officials 
to regulate the sluices. But being closer 
to the front line affords these local ad- 
ministrators options: They can choose 
to use their information either to en- 
sure that water reaches its target or to 
collect bribes in return for diverting it 
(10). And so, when institutions and in- 
formation flows are weak, politicians 
may yield to powerful elites, and bu- 
reaucrats may line their pockets. Thus, even 
the best plumbing in the world can still leave 
the poor with taps that run dry. 

More broadly, ensuring progressive redis- 
tribution will take more than just improving 
state capacity or building visible infrastructure 
and delivery systems. It may also require chang- 
ing who controls the taps and sluices, chang- 
ing the incentives that person faces, and/or 
changing the distribution rules that person is 
supposed to implement. A growing body of em- 
pirical research on the political economy of de- 
velopment is beginning to provide strong causal 
evidence for what practical interventions can 
allow the poor to leverage democracy effectively. 
This leveraging, arguably, is what the demo- 
cratic state must ensure if we are to start closing 
the inequality gap in the developing world. 


Reforming the democratic state to better 
serve the poor 


The political voice afforded the extreme poor in 
democracies is typically constrained by eco- 
nomic and social disadvantage. In many settings, 
the economic elite can exploit their social con- 
nections and economic power to provide poor 
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voters with individual incentives to vote in a 
particular way (11, 12). The result is widespread 
vote buying and the pervasive use of clientel- 
istic policies (13). Recent research conducted 
in rural Philippines and India documents how 
such practices can depress progressive redis- 
tribution (/4-16). Lower literacy and less access 
to relevant political information can further 
weaken the ability of the poor to use their vote 
to hold politicians accountable (17). 

As a consequence, far too often the poor 
elect politicians and administrators who do 
not share their interests. And, as in our water 
example, these politicians and administrators 
themselves often delegate powers to frontline 
workers whose actions are vital to the welfare 
of citizens but are not easily monitored by their 
superiors (8). In economics, each of these situa- 
tions embodies a principal-agent problem, in 
which one group (principals) delegates policy 
implementation to another group (agents) in 
settings typified by incomplete information and 
varying incentives (19). When the principal has 
less information than the agent and the two 
parties face differing incentives, the agent can 
deviate from prescribed actions and, instead, 
make personally beneficial choices. 

Within the principal-agent framework, eval- 
uations of reforms intended to strengthen state 
capacity for delivery have yielded insights on how 
selection procedures, contractual form, and tech- 
nology influence whether the intended policy 
goal is achieved (20). However, much of this lit- 
erature assumes that the principal is someone 
within the state (typically, an elected representa- 


tive or senior bureaucrat) and, importantly, is act- 
ing in the interests of the poor. As I argue above, 
this presumption is often unfounded. Political 
agency models recognize this and adapt the 
principal-agent framework for democratic set- 
tings. Here, citizens are the ultimate principals, 
and elected and appointed state officials are agents 
with competing information and incentives (27). 
Improvements in state capacity need not align 
incentives across citizens and state officials. 
A direct implication is that, for the poor, ef- 
fective democracy requires more than regularly 
occurring elections. It also requires democratic 
institutions that successfully disengage an indi- 
vidual’s ability to freely exercise her electoral 
rights from her economic power. It requires that 
the poor have reason to enter the social com- 
pact of taxation in exchange for public services. 
And it requires complementary investments 
that directly enable and incentivize citizen 
participation in the day-to-day business of 
democratic governance (for instance, by lobby- 
ing and negotiating policy with elected repre- 
sentatives and campaigning on specific issues). 
A growing body of empirical political economy 
papers use naturally random events or field ex- 
periments to examine which reforms to demo- 
cratic institutions can achieve these goals. 
Below, I describe a set of studies that show how 
greater effective enfranchisement of the poor, 
transparency initiatives, and combining tax col- 
lection efforts with mechanisms to enable citizen 
engagement with state officials can improve the 
ability of the democratic state to deliver policies 
that the poor favor. Methodologically, these studies 
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Fig. 1. National per-capita income and democracy as experienced by the 
world’s extreme poor. (A) The sample consists of 163 countries. Extreme poverty 
data from World Bank's PovcalNet was used, measuring those consuming less 
than $1.90 a day (adjusted for inflation to 2011 and for purchasing power), with 
linear interpolation, when possible, for missing country-years. Country income 
classifications are from the World Bank. The sharp changes in 1998, 2007, and 
2010 are China and India entering lower-middle income and China entering 
upper-middle income, respectively. (B) The sample consists of 155 countries. 
Same poverty figures as in (A). Democracy data was sourced from the Varieties of 
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Democracy project (1). The clean elections index, an expert-scored annual index 
that ranges from 0 to 1, measures to what extent elections are free and fair. 
Benchmarks: The index over this time period for Vietnam has been relatively stable 
at about 0.5, and that for China has been stable at exactly 0. The egalitarian 
component index, also an expert-scored annual index that ranges from 0 to 1, 
measures to what extent the egalitarian principle of democracy is achieved, 
including civil liberties, universalistic welfare policies, and lack of particularistic 
goods. Benchmarks: The index over this time period for Vietnam has been 
relatively stable at ~0.7, and that for China has declined from ~0.5 to ~0.4. 


share common ground with the experimental 
literature on evidence-based policies: They rely 
on random variation in citizen exposure to dem- 
ocratic reforms to isolate the causal impact of 
reforms. Substantively, the findings from these 
studies are consistent with the predictions of 
political agency models, and they support using 
this framework to identify future reforms. Fi- 
nally, and importantly, they provide guidance 
for development policy by showing how piece- 
meal reforms that target the levers of power 
can strengthen democracy and get resources 
flowing to those who need them the most. 


Enfranchising the poor 


Universal suffrage has been effectively ubiq- 
uitous in democracies and near-democracies 
since 1980, with a handful of exceptions (J). But 
complex de facto voting procedures often con- 
strain the ability of the poor to exercise their vote. 
Using technology to make voting procedures more 
accessible to the less educated can strengthen 
their political voice. For example, Brazil historical- 
ly used a paper ballot system under which it was 
common for more than a quarter of the votes to 
be deemed invalid, that is, either blank or error- 
ridden. In 1998, electronic voting devices were 
introduced to municipalities above a certain pop- 
ulation threshold, and the electronic interface 
was universally adopted in 2002. By providing 
voters step-by-step guidance and introducing 
visual aids, this technology facilitated voting by 
less literate citizens. Fujiwara compared munic- 
ipalities just above and just below the pop- 
ulation threshold for eligibility for electronic 
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voting machines across years and found that 
access to electronic voting reduced the number 
of invalid votes by more than 10% and increased 
the election of left-wing legislators (22). This, 
in turn, was associated with a 34% increase 
in public health care spending over an 8-year 
period. More investments in pro-poor health 
spending led to a 6.8% (0.5 percentage point) 
decrease in the prevalence of low-weight births 
among mothers without primary schooling. 


Informing the poor and making political 
behavior transparent 


If information flows between voters, politi- 
cians, and parties are weak, however, suffrage 
and enfranchisement may not be sufficient to 
enable the poor to use their vote as an effective 
political voice. In such cases, it may be neces- 
sary to directly provide citizens with actionable 
information on government performance. 

Using data from Brazilian municipalities, 
Ferraz and Finan showed that the public re- 
lease of audit reports lowered reelection rates 
of mayors from more corrupt municipalities, 
with more pronounced effects observed in 
municipalities with a local radio station (23). 
Information about politician behavior can 
therefore improve voters’ ability to select poli- 
ticians on the basis of performance. 

Another approach is to encourage informa- 
tive voter campaigns. Bidwell et al. conducted 
a large-scale experiment during the 2012 par- 
liamentary elections in Sierra Leone, where they 
randomized citizen exposure to pre-election 
candidate debates hosted and screened by a 
third party (24). Watching debates increased 
political knowledge, improved voter-candidate 
alignment, and increased both the number of 
votes cast and vote shares of the best-performing 
candidates. Candidates, in turn, increased their 
campaign efforts in communities where debate 
screenings were held. Tracking a small sample 
of legislators, the authors found that legislators 
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chosen to feature in debates held twice as many 
constituency meetings and spent 2.5 times more 
discretionary public funds on development proj- 
ects than did legislators not featured in debates— 
a consequential outcome for voter welfare. 

While Bidwell et al. raise the possibility that 
voter engagement can incentivize politicians 
to engage in more progressive redistribution, 
Banerjee et al. investigate this question di- 
rectly and at scale, in the context of the city 
government of Delhi, where roughly a third of 
the population of 20 million live in slums (25). 
Delhi’s elected councilors legislate on how to 
redistribute state resources, and they also have 
access to discretionary funds that can be spent 
on infrastructure. Survey data show a drastic 
mismatch between councilor spending and 
citizen preferences: Although sanitation was 
a priority for most slum-dwellers, most money 
went toward road construction. Against this 
backdrop, arandom sample of councilors were 
informed 2 years prior to city elections that a 
leading newspaper would publish report cards 
on their performance just before the city elec- 
tion. The informed councilors subsequently 
moved their spending in a pro-poor direction, 
a move that was rewarded by political parties. 
Specifically, among those male councilors in- 
eligible to run for reelection in their own ward 
(owing to it being declared reserved for women), 
those who had undertaken more pro-poor spend- 
ing were more likely to receive a party ticket to 
run from a nonreserved ward, which translated 
into electoral rewards: Councilors who under- 
took more pro-poor spending received a higher 
voter share in the subsequent election. 

This experiment led parties to run better can- 
didates, so it served as evidence of the democracy- 
improving power of both transparency and the 
gender quotas that caused parties to drop the 
worse-performing male candidates. Arguably, 
in both the Indian and Brazilian cases, the use 
of mainstream media to render reports public 
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Fig. 2. What citizens consider to be an essential characteristic of democracy. The sample consists of 64 low- 
and middle-income countries (high-income countries have been excluded, as none of the world’s extreme poor live in 
them). Data on citizen sentiment to democracy from the Global Barometer Survey, Wave 2 (2011-2013) (41) were used. 
Although these surveys lack information on respondent income, analysis of harmonized household surveys indicates 
that 80% of the world's poor live in rural areas and that the majority lack more than a primary education (42). Thus, 
preferences of the full population are compared with those of a “vulnerable” population residing in rural areas and lacking 
more than primary education, as a proxy for poverty. The same country income classifications are used as in Fig. 1A. 
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was important for credibility. This is consistent 
with findings of a recent set of harmonized 
field experiments on voter information (26), 
which indicate that the only successful infor- 
mational interventions were those that pro- 
vided information in a public setting. 


Encouraging participation by the poor 


The state’s ability to tax is a prerequisite for 
providing services to the poor—so what will 
persuade citizens to enter this social compact? 
Weigel examines this question in the city of 
Kananga in the Democratic Republic of the 
Congo, which raised a minuscule $2 million per 
year in a province of 6 million people (27). Prior 
to the experiment, most citizens had never paid, 
nor been solicited for, formal taxes by the mod- 
ern Congolese state. Kananga’s government 
randomized property tax collection across its 
431 neighborhoods. In taxed neighborhoods with 
in-person collection, collectors went door-to-door 
registering households and collecting the ap- 
proximately $2 property tax. In control neigh- 
borhoods, citizens were left to voluntarily pay 
at the tax ministry. All citizens were encouraged 
to attend government-hosted town hall meetings, 
where officials and citizens discussed tax and 
public spending in Kananga, and to submit anon- 
ymous evaluations of the provincial government. 

The campaign increased the probability of 
visits from tax collectors by 81.5 percentage 
points (from 0.05% in control) and increased 
property tax compliance by 11.5 percentage 
points (from 0.001% in control). The property 
taxes collected during this campaign made 
up just under 5% of the provincial government’s 
total revenue, on par with local governments in 
more prosperous African countries. 

The campaign also yielded a participation 
dividend, increasing both town hall attendance 
and evaluation form submission. The citizens 
who were exposed to visits by tax collectors also 
positively evaluated the provincial government, 
citing more revenue, less leakage, and a greater 
responsibility to providing public goods. These 
effects on beliefs about the government sug- 
gest that enhancing citizen participation while 
expanding the tax net can instill in citizens the 
sense of an incipient social compact with the 
state. Along these lines, Olken reports 20 times 
higher citizen participation and greater citi- 
zen satisfaction when local village projects in 
Indonesia were chosen by direct plebiscite 
rather than representative village meetings (28). 
Villagers perceived projects selected by direct 
plebiscites to be fairer and more legitimate than 
projects of the same type selected by representa- 
tive meetings. 


The catch-22 of democratic reform 


In ademocracy, power is never as simple as “one 
person, one vote.” When the poor lack power— 
both to command sufficient resources for them- 
selves and to improve the democratic system 
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that is failing them—who has the power, or the 
incentive, to provide them with democratic in- 
fluence? This, arguably, is the catch-22 of dem- 
ocratic reform. 

One way to resolve the dilemma, inspired by 
the modernization hypothesis, is to focus on 
policies that affect growth. The main argu- 
ment of the modernization hypothesis is that 
the social transformations which accompany 
economic development create more politically 
aware citizens and, thereby, the conditions for 
democratic development (29). Yet, at the level 
of cross-country analysis, evidence supporting 
this hypothesis remains weak (30). 

Another way is to focus on identifying feasi- 
ble paths to reform within a given institutional 
setting (37). We can start by considering three 
groups of players who might propel reform: 
(i) the poor and disadvantaged who would ben- 
efit from it, (ii) the economic and political elite 
who control the levers of power from within 
the system, and (iii) actors, such as international 
development and human rights agencies, who 
influence reform from the outside. We can try to 
determine how these players might identify 
aligned incentives, create effective coalitions for 
change, and overcome resistance from those 
who stand to lose from reform. And, finally, we 
can identify which institutional structures will 
ensure effective reform implementation. 

A concrete example can help clarify how such 
a research agenda could play out. During the 
19th century, the United Kingdom (ike many 
Western countries) extended the vote from wealthy 
landowners to all males. Lizzeri and Persico 
argue that British elites broadened the franchise 
because it better aligned citizen and politician 
policy preferences, increased the electoral value 
of policies with diffuse benefits, and reduced the 
returns to politicians from clientelistic policies 
(32). The authors highlight the role of powerful 
insiders—a core set of politicians who popular- 
ized the view that extending the franchise was 
essential “to reduce the pervasiveness of patron- 
age and to coax the machinery of government 
to serve the public purpose.” What motivated 
these politicians, and how did they convince the 
broader population? Here, the authors point to 
the role of “philosophical radicals” who sym- 
pathized with the ideals of democracy but were 
also motivated by an awareness of the role of 
public goods, especially public health investments 
in improving citizen health and preventing out- 
breaks of diseases such as cholera. Increasing 
awareness of public health enabled the radicals 
to broaden support for universal male franchise. 
Thus, public health considerations prompted a 
set of powerful insiders to league with the poor, 
perhaps to safeguard their own well-being. What 
lessons does this historical example hold for 
democratic reform today? 

First, it shows that the identity of insiders 
matters. This issue is particularly salient in dis- 
cussions of identity politics, especially when 
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low population share or historic disadvantage 
limits the direct electoral clout of a group. 
Sometimes, reform will come from enlightened 
insiders, but external players can also play a role, 
as the United Nations did in the 1995 Beijing 
conference, pushing for gender quotas in political 
leadership. Since then, more than 100 countries 
have adopted some form of gender quota. And 
inclusive effects can cascade as the poor and 
disadvantaged become insiders themselves: 
Studies in India show that female representa- 
tion has been associated with greater invest- 
ments in drinking water in villages and gains 
in maternal and child health (33, 34). Women’s 
presence in government also influences subse- 
quent political behavior and citizen engagement 
with the state. For instance, Beaman et al. show 
that Indian villagers positively revise their beliefs 
about women’s ability to be effective leaders once 
(thanks to quotas) they are exposed to female 
leadership (35). Note that women are more 
likely to run in subsequent elections even in the 
absence of quotas. Political affirmative action 
for ethnic minorities is rarer but can similarly 
lead to better representation of their interests. 
For instance, in an earlier work, I show that 
mandated representation for lower castes and 
tribal groups in India was associated with in- 
creases in targeted redistribution (36). In this 
case, the introduction of quotas reflected the fact 
that a lower-caste citizen wrote India’s constitution. 

Second, an awareness of shared policy in- 
terests across lines of class (and possibly social 
identity) can generate coalitions for change. 
Troesken shows that health and life expect- 
ancy improved measurably among African 
Americans in the Jim Crow South—even as 
they suffered extreme discrimination in nearly 
every area of public life—because the elite 
realized that when it came to water and sani- 
tation, Black and white interests were inter- 
dependent (37). This is a rich area for study; 
today, climate breakdown and a global pan- 
demic are affecting the fortunes of both rich 
and poor in highly unequal societies. Educa- 
tional institutions and the media, both tradi- 
tional and social, could play an important role 
in increasing an awareness of aligned interests 
extending well beyond public health. 

Third, identifying ways of strengthening 
party structures and selection procedures within 
parties can yield important returns. The exten- 
sion of the voting franchise marked the begin- 
ning of allegiances to political parties in the 
United Kingdom, and competition between them 
was associated with their staking clear policy 
positions on progressive redistribution. Lower- 
income democracies often have weak party struc- 
tures. One example of how these parties, as 
insiders, can push for reform comes from Mexico. 
Using the example of Mexican states passing 
freedom of information acts, Berliner and Erlich 
show that political competition, by creating un- 
certainty over future political control, gave 
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Mexican political parties incentives to undertake 
transparency reforms that “serve as insurance 
mechanisms enabling ruling groups to protect 
their access to government information, and to 
preserve means of monitoring future incum- 
bents, in case they lose power” (38). The threat 
of one day being the underdog can push elite 
insiders to create a fairer system. 

Finally, poor citizens can themselves moti- 
vate reform. Although the extension of the 
voting franchise in the United Kingdom was 
peaceful, citizen-led political protests, both vio- 
lent and nonviolent, are an important method 
by which the poor and disadvantaged can di- 
rectly push for democratic reform. Nepal held 
its first democratic elections only in 2017, in 
the wake of a 10-year civil war. The main Maoist 
group backing the revolution entered democratic 
politics after the peace agreement was signed. 
This group pushed for the implementation of 
one of the world’s most progressive constitutions 
and played an important role in supporting the 
successful candidacy of ethnic minorities in 
the 2017 elections (39). Protests by minorities 
can also induce reform by “agenda seeding.” 
Using evidence from Black-led protests in the 
United States in the 1960s, Wasow demonstrates 
that activists can use methods such as disrup- 
tion to capture the attention of the media and 
overcome political asymmetries (40). 


Outlook 


The task, for researchers, is to expand the evi- 
dence base for exactly which democratic reforms 
are the most effective in providing the poor with 
a productive political voice, so that all people— 
the poor, insiders, and outsiders—can best take 
advantage of opportunities to effect change when 
they appear. A broader evidence base can also 
serve another purpose, as outsiders, such as in- 
ternational agencies seeking to end extreme 
poverty, face backlash at home for funding anti- 
poverty programs in countries where there are 
visible displays of wealth. Investing in interven- 
tions that have been shown to enhance dem- 
ocracy in developing countries may be more 
palatable than, say, sending cash. 

Clearly, making the democratic machinery 
work for the poor is a much more complex 
proposition than the technocratic task of “re- 
pairing the plumbing.” Building state capacity 
and visible infrastructure are necessary but not 
sufficient. We must also strengthen democratic 
power for the poor, particularly in lower income 
settings. I have focused on areas with good ex- 
perimental or quasi-experimental evidence on 
the effectiveness of reforms, and so I have not, 
for example, explored the potentially fertile ter- 
ritory of directly weakening the democratic in- 
fluence of the economic elite. The examples in 
this Review, however, show that imaginative 
and strategic coalition-building between policy 
actors with aligned incentives can bring about 
reform and empower the poor. 
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REVIEW 


Democracy’s backsliding in the international environment 


Susan D. Hyde 


If the end of the 20th century was defined by the relatively widespread acceptance of democracy, the second 
decade of the 21st century is marked by concerns about backsliding in new and established democracies 
alike and by a notable decline in foreign support for democracy around the world. As democracy’s global 
tailwinds shift to headwinds, scholars have an opportunity to better understand how experience with even 
superficial forms of democratic institutions across a diverse set of contexts influences citizen behavior when 
formal democratic institutions erode or disappear. This shift also provides the opportunity to examine whether 
citizen movements alone—absent external support—are sufficient to check newly emboldened autocrats. 


nitiated with the 1974 Carnation Revo- 

lution in Portugal, the most recent global 

wave of democratization has been filled 

with images of exuberant citizens march- 

ing to demand democracy from vicious 
tyrants, individuals taking sledgehammers 
to the Berlin Wall, and new voters proudly 
displaying indelibly inked fingers on election 
day. In many narratives, the picture is one of 
bottom-up demand for democratic governance. 
Although domestic factors, including citizen 
movements, are undeniably important, a com- 
plete picture of the worldwide diffusion of de- 
mocracy also includes the often-overlooked 
roles of international pressure on leaders to 
democratize and transnational support for 
prodemocracy movements (J-7). In the late 
1990s, Nobel Prize-winning economist Amartya 
Sen called the rise of democracy, as a “universal 
value,” the most important event of the 20th 
century. As he wrote, “[wJhile democracy is not 
yet universally practiced, nor indeed uniformly 
accepted, in the general climate of world opi- 
nion, democratic governance has now achieved 
the status of being taken to be generally right” 
(8). The empirical patterns are pronounced; as 
democracy grew in global prominence, nearly 
all sovereign states in the world introduced 
direct national elections and allowed multi- 
ple political parties to compete, if not win 
(9). At the peak of this trend, all but seven 
countries held national elections, 95% of which 
allowed for the possibility of multiparty com- 
petition (Fig. 1). Countries that held no na- 
tional elections between 2010 and 2018 were 
Brunei, China, Eritrea (postponed), Qatar (sched- 
uled, canceled), Saudi Arabia, Somalia (post- 
poned), South Sudan (postponed), and the 
United Arab Emirates. 

However, after decades of near-global dom- 
inance in which even powerful autocracies 
like China have claimed to be moving toward 
democracy, the second decade of the 21st cen- 
tury has been marked by concerns about demo- 
cratic backsliding—also called democratic erosion 
or autocratization. Such concerns are present 
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in many countries around the world including 
some long-standing democracies like India and 
the United States, which were previously thought 
to be well insulated from such change (10-/4). 
Democratic backsliding research is relatively 
new, and, although there are some definitional 
debates, it involves incremental changes away 
from representative democracy and toward 
authoritarianism. “Backsliding makes elections 
less competitive without entirely undermining 
the electoral mechanism; it restricts participa- 
tion without explicitly abolishing norms of 
universal franchise seen as constitutive of con- 
temporary democracy; and it loosens con- 
straints of accountability...” (72). 

Concerns about democratic backsliding 
are now prominent in dozens of countries: 
Populism, nationalist movements, declining 
trust in institutions, rising income inequal- 
ity, and the U.S. abandonment of its global 
leadership role threaten what had been reason- 
ably widespread popularity and legitimacy of 
democratic governance (15). Influential sub- 
sets of political parties around the world have 
embraced antidemocratic sentiment within 
their ranks (11, 15). In foreign policy, a decline 
in democracy promotion by powerful Western 
actors and overt challenges to international 
norms that privilege democracy and human 
rights present a related trend (16-18). 

As democracy’s global tailwinds shift to 
headwinds, what are the consequences? Al- 
though scholars have long acknowledged that 
international dimensions matter in the study 
of domestic politics (19), it is also the case that 
international norms and Western support 
for democracy have declined more in the past 
4 years than in the prior 40. This shock to the 
international environment, which may be 
lamentable from a normative perspective, also 
represents an opportunity for researchers to 
better understand international factors in the 
most recent wave of democratization (20) and 
how international pressure and transnational 
diffusion now condition democratic backslid- 
ing. A lack of sufficient variation over time in 
Western support for democracy since the late 
1970s has made studying its consequences 
more difficult. The precipitous decline in inter- 
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national support for democracy should provide 
new insights into several areas: (i) which leaders 
were more responsive to international con- 
straints on their repressive tendencies (21), 
Gi) in which countries international support 
for democracy played a less important role, 
and, more generally, (iii) which cases that had 
previously been interpreted as democratiza- 
tion were illusory to begin with. Scholars now 
have a singular opportunity to evaluate how 
experience with even superficial forms of dem- 
ocratic institutions across a diverse set of con- 
texts influence citizen behavior when those 
institutions erode or disappear, and they can 
also explore whether citizen movements alone, 
absent external support, are sufficient to check 
newly emboldened autocrats. 


The prodemocracy international environment 
and pseudodemocracy 


For decades, would-be autocrats inhabited an 
international system that clearly favored demo- 
cracy (2, 6, 17), including explicit democracy 
assistance programs, rhetorical support for 
democracy from the world’s most powerful 
leaders, changes in international organizations 
that committed member states to democracy 
(7), the idea that democracies made better 
international partners, and even a perception 
among foreign investors that democracies 
have less political risk [see (22) for a summary 
of this literature]. Western democracy promo- 
tion has always been partially undermined by 
geopolitical (23) or partisan objectives (24, 25) 
and open to critique on many dimensions (77, 26). 
Even so, the combination of direct democracy 
assistance and international norms favoring 
democracy operated in most regions of the 
world both at the elite level—constraining 
leaders who would otherwise engage in more 
overtly authoritarian behaviors (21, 27)—and at 
the citizen level—by aiding prodemocracy citi- 
zen movements and supporting civil society (28). 
Under this prodemocracy international 
environment, blatant antidemocratic behavior 
became more costly for autocratic leaders (3, 6), 
the consequences of transparently stealing 
elections increased (J, 3), and the costs of 
being excluded from the democratic club grew 
(3, 15). For citizens and opposition parties, the 
global prodemocracy turn carried with it not 
only increased constraints on authoritarian 
governments but also powerful ideas about 
universal rights for individuals, self-determination, 
and popular sovereignty (29). These ideas included 
arguments against the discriminatory idea that 
only some peoples were “fit for democracy” (8), 
access to networks of prodemocracy movements 
in other countries (28), concrete strategies of 
resistance (16, 30), and, in many cases, substan- 
tial Western support for civil society, including 
support for democracy-building activities (5). 
Suggestive empirical analyses have docu- 
mented both spatial and temporal clusters 
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in the spread of democracy (31, 32) and have 
provided strong evidence that trends in demo- 
cratic and autocratic transitions follow major 
changes in the international environment 
(2, 20, 33). Gleditsch and Ward, for example, 
have shown that the probability that an autoc- 
racy in a predominantly autocratic region 
will transition to democracy in a given year is 
just 0.015, but when the proportion of demo- 
cratic neighbors exceeds two-thirds and when 
at least one neighbor transitioned recently to 
democracy, the probability that a given autoc- 
racy will become democratic increases by a 
factor of 10 (37). Research on electoral revolu- 
tions and prodemocracy protest movements 
suggests that democracy activists learn trans- 
nationally and adapt strategies from each other, 
sometimes through internationally supported 
horizontal networks of activists (28, 34), and that 
mass movements aimed at bringing down dicta- 
tors are often regionally contagious (28, 34-36). 
Figure 2 illustrates regional clustering in the dif- 
fusion of formal democratic institutions (37, 38, 39). 
Research on the international diffusion of 
democracy has highlighted coercion, compe- 
tition, emulation, and learning as the central 
mechanisms by which democracy spreads (32). 
Recent studies have linked such processes to 
great power transitions at the system level, 
arguing that whether the most powerful country 
in the world is democratic or authoritarian 
causes global waves of regime transitions (al- 
though regimes that result from these transitions 
are not necessarily durable) (2, 40). Additionally, 
other studies have shown that countries were more 
likely to democratize when they had more eco- 
nomic and political linkages with Western coun- 
tries (47), were members of intergovernmental 
organizations with more democratic members 
, 7), or were geographically proximate to other 
democracies (31). Relatedly, democracy promo- 
tion and foreign aid conditioned on democratic 
reforms have been shown to have meaningful, 
positive effects on democratization (2, 6, 42). 
The prodemocracy international environment 
supported the global diffusion of democratic 
institutions and ideas, even if it did not always 
bring about genuine, durable democracy. Even 
partial political liberalization gave citizens more 
exposure to democratic practices, freedom of 
expression, individual rights, and experience 
with less overt repression than would have been 
the case absent a prodemocracy international 
environment—a counterfactual perspective 
that is in contrast to much of this literature’s 
characterization of hybrid regimes as failed 
democratic transitions [e.g., (41)]. Democracy 
promotion undoubtedly contributed to many 
cases of successful democratic transitions, but 
it also likely led to the diffusion of hybrid regime 
types, which combine aspects of democracy 
and authoritarianism. Many of these hybrid 
regimes could be characterized as pseudo- 
democracies, in which leaders were willing to 
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adopt democratic institutions—like multiparty 
elections, a pluralistic media environment, and 
universal suffrage—but had no intention of 
giving up power if they were to lose an election 
(3, 43), a form of exposure that pseudodemo- 
cratic leaders worked to avoid. The fact that 
some leaders have displayed more skill than 
others at maintaining plausible deniability that 
they were moving toward democracy has stood 
as a fundamental roadblock for researchers in 
measuring the extent of pseudodemocracy and 
its consequences (2). 


Emergent threats to democracy from above 
and below 


Current threats to democracy around the 
world come from both international and do- 
mestic sources, and even internal threats to 
democracy, like antidemocratic political parties, 
often have global ties. Although the Trump 
administration has accelerated the decline of 
US. support for democracy abroad, including 
its “admiration” of dictators and “disdain for 
traditional democratic allies” (75), the most 
recent trends undermining democracy promotion 
have been ongoing since 2001, amid subsequent 
US-led anti-terrorism foreign policy efforts. This 
includes declining enthusiasm for democracy 
promotion by a variety of Western actors (15), 
such as the European Union (44). These changes 
“have turned a world that was once relatively 
favorable to the spread of democratic norms into 
one where authoritarians can push back—and 
have learned to do so in innovative ways” (18). 
Recent research documents at least three ways 
in which the international environment has 
shifted away from democracy that have implica- 
tions for understanding democratic backsliding. 
First, although many forms of democracy 
assistance continue, the decline in more overt 
forms of democracy promotion and the rhetor- 


ical embrace of authoritarian leaders are likely 
to ease pressure on elites to appear plausibly 
democratic and simultaneously decrease sup- 
port for mass movements that favor democracy 
(16, 18). Prodemocracy citizen movements are 
now more likely to face leaders who are willing 
to engage in blatant targeting of political op- 
position and willing to commit violence against 
citizen demonstrations, and these movements 
are receiving less support from foreign allies. In 
Cambodia for example, members of the opposi- 
tion Cambodian National Rescue Party (CNRP) 
were arrested on false charges, harassed, and, 
in some cases, forced into exile (45). By 2017, 
Hun Sen’s government abandoned any pre- 
tense of democracy and banned the CNRP. As 
one prominent human rights commentator 
stated, “Prime Minister Hun Sen’s repression 
of the opposition, media, and activist groups 
has effectively turned the country’s democracy 
into a one-party state” (45). Up until at least 
2013, the Cambodian government was some- 
what responsive to Western pressure, allowing 
opposition parties to organize and permitting 
foreign democracy promotion programs to oper- 
ate. However, the increasing ability of Hun Sen’s 
government to rely on Chinese support instead of 
Western aid, combined with more-tepid Western 
democracy promotion efforts, have substantially 
diminished the Cambodian government's will- 
ingness to allow any political opposition (45). 
Second, mirroring patterns in the diffusion of 
democracy, authoritarian practices and strat- 
egies to undermine democratic institutions are 
more freely diffusing across borders by means 
of emulation and learning (46, 47). There is not 
yet much evidence that backsliding has been 
caused by overt coercive autocracy promotion 
by either China or Russia or by explicit rewards 
for autocratic partners (48). However, emulation 
of strategies of repression and the surveillance 
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Fig. 1. The global diffusion of national elections and formal political competition, 1950 to 2018. As democracy 
grew in global prominence over time, nearly all sovereign states in the world introduced direct national elections and 
allowed multiple political parties to compete. At the peak of this trend, all but seven countries held national elections, 
95% of which allowed for the possibility of multiparty competition. Data are from the National Elections Across 
Democracy and Autocracy (NELDA) project (9). Recent national elections are considered to be those from the current 
year or any of the 4 prior years. The potential for formal political competition is defined as in (9) and includes elections 
in which electoral opposition was allowed [the variable referred to as NELDA3 in the NELDA dataset (9)], a choice 
of candidates appeared on the ballot (NELDA4), and at least one opposition party was legal (NELDAS5). The full data and 
codebook are available at https://nelda.co/. Replication data and code are available from Harvard Dataverse (62). 
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of dissidents appears to be on the rise (47). For 
example, as the Arab Spring protests began 
to diffuse from Tunisia, dictators—fearing 
that similar protests would take hold in their 
own countries—were able to adapt strategies 
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to more effectively suppress prodemocracy 
protests (34). Although the protests in Tunisia 
and Egypt succeeded in overthrowing dicta- 
torial regimes, no other Arab Spring protest 
movements that followed were able to over- 


Fig. 2. Regional and temporal clusters in democracy and backsliding around the world. Cartograms 
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illustrate the regional and temporal diffusion of electoral democracy around the world in four snapshots of 
independent states: 1976 (the beginning of the most recent wave of democratization, initiated in Portugal), 1989 
(showing the regional clustering of transitions to democracy in Latin America), 1999 (showing the clusters of 
transitions in Eastern Europe and sub-Saharan Africa), and 2019 (in which democracy scores have decreased 
in many regions). The measure of democracy is the Electoral Democracy Index vl0 from the Varieties of 
Democracy project, which measures a minimalist definition of democracy and is available at www.v-dem.net/en/ 
data/data-version-10/. Replication data and code are available from Harvard Dataverse (62). Microstates and 
nonindependent territories are not shown, as defined by Weideman et al. (37) in the CShapes dataset. 


throw their governments (34). Recent research 
has documented Chinese and Russian strate- 
gic censorship and internet filtering by key- 
words that is aimed at minimizing the spread 
of information about protests and inoculating 
their governments from the diffusion of pro- 
tests to their own citizens (36). Another recent 
study has similarly documented regional diffu- 
sion of illiberal norms that restrict civil society, 
including cases in which restrictive new regu- 
latory language was copied verbatim from 
neighboring countries (46). Just as mass protests 
have proven regionally and globally conta- 
gious, inspiring citizens in other countries to 
take to the streets, some authoritarian elites 
have learned how to neutralize challenges to 
their rule from the experiences of their peers 
(34, 46, 47). Similarly, extremist political parties 
with authoritarian tendencies may emulate 
rhetoric and tactics aimed at hollowing out 
democracy in their countries, which represents 
a globalized threat to democracy from below. 
A third threat to democracy, led by Russia 
and China, aims to undermine the interna- 
tional norm of democracy and the legitimacy 
of democracy promotion efforts in part by 
privileging state security over individual rights 
(8). As the international norm of democracy 
erodes and associated international benefits 
decline, related prescriptive international 
norms—like the expectation that leaders will 
invite international election observers to scruti- 
nize their elections (3, 4)—may also weaken. This 
will result in fewer opportunities for international 
actors to effectively penalize stolen elections (3) 
and the loss of a periodic opportunity for inter- 
national support of democratic elections. Inter- 
national election observation has been linked to 
less election fraud (3) and more durable peace 
agreements after civil war (49). These emergent 
threats to democracy combine to create poten- 
tially divergent consequences for heads of state, 
other political elites, and citizens, as would-be 
autocrats seek to manage potential elite and 
mass challenges to their continued hold on 
power (50, 51) under shifting global constraints. 


Citizens and democratic backsliding in the 
international environment 


The recent, massive protests against democratic 
erosion in Hong Kong, initiated in 2019, stand 
out as a movement that would have been met 
with substantial international support just 
3 years earlier. Such support would likely have 
included diplomatic pressure on leaders in 
Hong Kong and China to moderate their stance 
and rhetorical and material support for protes- 
tors from democracies around the world. In- 
stead, external support for the prodemocracy 
movement was underwhelming. Even so, 
hundreds of thousands of citizens were willing 
to put their lives at risk to demand democracy 
and protest curtailments of individual free- 
doms. This stands as an example of how citizen 
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preferences for democracy can develop under 
limited democratization and how citizen-led 
protest movements can be powerful even 
without international support. However, the 
strong Chinese reaction, U.S. and British capi- 
tulation, and Hong Kong’s uncertain future 
raise questions about whether citizens alone 
can successfully defend democracy in the cur- 
rent international environment (/6). Recently, 
the 2020 Chinese security law quashed the 
1 July prodemocracy protests, which have marked 
the anniversary of the handover of political con- 
trol of Hong Kong to China since 1997 (52). 

Although the Hong Kong case is unparalleled 
in many ways, it illustrates how elite-driven 
democratic backsliding was met with surpris- 
ingly robust citizen-led resistance. Yet even 
with massive public resistance to backsliding, 
and likely because of declining global support 
for democracy, political elites ordered a bla- 
tantly and unapologetically brutal crackdown 
against protestors. What would have happened 
if international actors had quickly and force- 
fully supported the protestors’ cause? 

Democracy’s new global headwinds will 
likely have heterogeneous effects across types 
of democracy, including opportunities to learn 
about the consequences of pseudodemocracy 
for citizens. Some consequences of backsliding 
will depend on whether members of power- 
sharing coalitions also gain from autocratiza- 
tion (50) and whether business, military, and 
religious elites are content with changes to their 
statuses under democratic backsliding (72). For 
citizens, the global turn away from democracy 
could result in several divergent potential im- 
plications that future research should consider. 

One area for further study is how the col- 
lective action potential of citizen movements 
to defend democracy is influenced by varying 
forms of autocratization. Thus far, documented 
autocratization has been gradual and subtle 
rather than sudden and blatant, which may 
undermine citizens seeking to defend democ- 
racy against backsliding (14): “[E]lections are 
being hollowed out as autocracies find ways to 
control their results while sustaining a veneer 
of competitive balloting. Polls in which the 
outcome is shaped by coercion, fraud, gerry- 
mandering, or other manipulation are increas- 
ingly common. ...indicators for elections have 
declined at twice the rate of overall score totals...” 
(53). Similarly, Liihrmann and Lindberg, dis- 
cussing the challenge in measuring autocrat- 
ization, have suggested that “[rJuling elites 
shy away from sudden, drastic moves to auto- 
cracy and instead mimic democratic institu- 
tions while gradually eroding their functions” 
(14). Abrupt transitions to dictatorship, like 
military coups, remain rare (/4). 

Thus, a critical juncture for the potential 
power of citizen movements will be whether 
the competitive veneer of democracy is sus- 
tained or leaders drop the act entirely, includ- 
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ing the elimination of multiparty elections. 
Once introduced, regular elections provide an 
established focal point for collective action 
to enforce democracy when it is threatened 
(54, 55). Since the mid-1970s, some leaders 
who sought to manage the introduction of 
multiparty elections without losing their hold 
on power miscalculated in a manner that 
ultimately led to their downfall. Such downfalls 
have included either losing outright, in what 
Huntington has called “stunning” elections (20), 
or cheating to win, getting caught, and facing 
postelection protests and electoral revolutions, 
often combined with international pressure to 
step down or hold new elections (28, 56). 

Figure 3 shows the total number of elections 
held annually over time, which fluctuates 
naturally from year to year. But after steady 
increases throughout the post-World War IT 
period, the number of countries holding elec- 
tions hit a plateau and may be in decline. 
Although 2016 matched the highest number 
of elections ever held, in 2017 the number of 
elections globally dropped to a number not 
seen since the late 1980s. This could be a 
result of idiosyncratic fluctuations that result 
from differences in the length of electoral cy- 
cles and parliamentary systems with variable 
election timing, but it could also be related 
to backsliding trends. As partial evidence of 
backsliding, the lower line in Fig. 3 also shows 
how many elections each year are character- 
ized by government harassment of opposition 
parties and/or media bias in favor of the in- 
cumbent, both of which have increased since 
the mid-2000s. 

In 2020, the global pandemic gave many 
governments a legitimate public health reason 
to postpone or suspend elections, and these 
suspensions may serve as effective cover for 
some leaders to more permanently eliminate 
elections, which have represented a power- 


ful risk to leaders’ hold on power. Although 
citizens could use the cancellation of elections 
as a onetime focal point to coordinate protest, 
the periodic nature of elections—even in electoral 
authoritarian regimes—facilitates a regular and 
collectively anticipated opportunity for citizen 
coordination that would be hard to replace if 
elections were eliminated. 

The second area in which changes in the 
international environment provide new re- 
search opportunities relates to where citizen 
support for democracy will prove more dura- 
ble in the face of backsliding. Is exposure to 
even superficial democratic institutions suffi- 
cient to generate either elite-led backlash and/ 
or mass protests against democratic erosion? 
Or are such reactions much more likely in 
countries with long-term experience with democ- 
racy? Recent research has suggested that threats 
to democracy can, on average, cause increased 
public support for democracy (57), but it is not 
yet clear whether all forms of backsliding are 
similarly likely to trigger countermobilization. 
Democracy protests and citizen-led mass mobili- 
zation remain powerful forces (30, 58, 59). As 
backsliding spread and international support 
for democracy declined, citizens in some coun- 
tries took to the streets to demand political 
freedom, including the massive protests in 
Algeria, Bolivia, and Sudan (17). Sudan’s recent 
popular uprising unseated a dictator who had 
withstood decades of international pressure, 
including an International Criminal Court 
indictment. Indeed, some of the notable cases 
of democratic progress amidst the more gen- 
eral trend of backsliding in 2019 were linked to 
mass protests. It may not be easy for elites to 
put the genie back in the bottle. But whether 
citizen movements alone, absent external sup- 
port, will be sufficient to check newly embold- 
ened autocrats will not be apparent for years 
to come. 
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Fig. 3. Total national elections per year with increasing trends in opposition harassment and/or media bias, 
1945 to 2018. There has been a recent plateau or ceiling in the total number of elections held each year from 1945 
to 2018 (top lines). At the same time, there has also been a recent increase in elections characterized by 
government harassment of their political opposition and/or media bias in favor of the incumbent candidate or party, 
with a notable uptick since 2006. Smoothed trendlines overlay the annual counts. Data are from the NELDA project 
(9). Government harassment of opposition is measured as NELDA15, and media bias in favor of the incumbent 

is measured as NELDAI6. The full data and codebook are available at https://nelda.co/, and code and data used to 
produce the figure are available from Harvard Dataverse (62). lowess, locally weighted scatterplot smoothing. 
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Of course, it is also possible that in many 
countries, citizens will react to democratic 
backsliding with a shrug, perhaps even cheer- 
ing the demise of political processes that did 
not serve their interests, as some have argued 
is the case in Russia under Vladimir Putin or 
the United States under Donald Trump (60, 67). 
It is possible that citizen acquiescence to back- 
sliding is more likely in pseudodemocracies, 
but more research is necessary to address this 
and related questions. 


Outlook 


Contemporary empirical research on democ- 
racy is, by necessity, limited to the range of 
observed behavior. As researchers and practi- 
tioners turn toward an uncertain future of 
geopolitical competition, they should engage 
critically with research conducted during periods 
of democracy dominance. Some findings will 
prove durable, whereas others will not. 

Changing geopolitics mean not just a de- 
cline in Western support for democracy, but 
also probable increases in interference by 
Russia and China aimed at either undermining 
democracy or reducing the accountability of 
political elites to citizens. Countries that have 
previously felt pressure to adopt democratic 
institutions will not be simply left to their own 
devices, free of any foreign interference. As the 
United States has stepped back from a global 
leadership role, China and Russia have led 
efforts to undermine democratic institutions 
around the world with little resistance from 
what used to be democracy’s more prominent 
defenders. Shifts away from democracy as a 
“universal value” (8) are likely to be consequen- 
tial for research at the national, cross-national, 
and individual levels. Without ignoring the 
potential normative implications of the global 
turn from democracy, researchers have an op- 
portunity to provide a more complete under- 
standing of the international dimensions of 
democracy and dictatorship. 
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False equivalencies: Online activism from left to right 


Deen Freelon’*, Alice Marwick2*, Daniel Kreiss’* 


Digital media are critical for contemporary activism—even low-effort “clicktivism’” is politically consequential 
and contributes to offline participation. We argue that in the United States and throughout the industrialized 
West, left- and right-wing activists use digital and legacy media differently to achieve political goals. Although 
left-wing actors operate primarily through “hashtag activism” and offline protest, right-wing activists 
manipulate legacy media, migrate to alternative platforms, and work strategically with partisan media to spread 
their messages. Although scholarship suggests that the right has embraced strategic disinformation and 
conspiracy theories more than the left, more research is needed to reveal the magnitude and character of 
left-wing disinformation. Such ideological asymmetries between left- and right-wing activism hold critical 
implications for democratic practice, social media governance, and the interdisciplinary study of digital politics. 


ctivism is a fixture of contemporary 

politics, both democratic and otherwise. 

At its core is the drive to enact or pre- 

vent political, cultural, and/or social 

changes by a range of means. Although 
nonelite citizens have advanced activist claims 
against the powers that be for millennia (J), 
in the 21st century, digital media offer un- 
precedented tools for activists around the 
world to help realize their sociopolitical 
visions. In this review, which focuses on the 
United States but also incorporates evidence 
from other countries, we argue that both the 
ideological left and right use the ad- 
ditional channels and low-cost parti- 
cipation afforded by digital media to 
reach potentially sympathetic publics. 
However, despite some similarities, re- 
cent research indicates that left and 
right differ sharply in how they use di- 
gital media. Whereas the left generally 
combines on- and offline protest ac- 
tions with transmedia branding, an ap- 
proach known as “hashtag activism” 
(2), the right tends to eschew offline protest 
(notwithstanding a few prominent excep- 
tions), preferring instead a combination of 
“trolling” or manipulating mainstream me- 
dia, protest against and even strategic exit 
from platforms owned by “Big Tech,” and 
cooperation with ideologically friendly me- 
dia outlets. Moreover, available evidence 
suggests that the right has invested far 
more than the left in disinformation and 
conspiracy theories as core components of 
its activist repertoire, although a lack of sim- 
ilar research on the left makes comparisons 
difficult. These asymmetric trends hold im- 
portant implications both for scholarship 
and for democratic practice. 
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Low cost, high benefit: Clicktivism and 
political participation 

Since the start of social media’s diffusion 
throughout Western societies, concerns have 
been raised about its efficacy for political par- 
ticipation. One prominent early objection was 
that “slacktivism” or “clicktivism,” low-cost sym- 
bolic actions such as sharing, “liking,” changing 
one’s profile image, and generally posting ac- 
tivist content on social media, projects an 
impression of efficacy without actually being 
effective (3). The two assumptions underlying 
this objection are, first, that such digitally 


“Digital political activities—including 


low-cost ones—are a 


complement to, not a substitute for, 
their offline counterparts.” 


mediated symbolic behaviors are generally not 
consequential in and of themselves and, sec- 
ond, that they substitute for more impactful 
actions such as voting or offline protest. Later, 
we will turn to recent research on how digital 
activism can be highly impactful on its own, 
contributing to phenomena such as disinfor- 
mation. Meanwhile, empirical research has 
consistently failed to support the proposition 
that digital action substitutes for offline action 
(4-6). That is, people who are strongly inte- 
rested in politics tend to express that interest 
through both online and offline behaviors. Dig- 
ital political activities-including low-cost ones- 
are a complement to, not a substitute for, their 
offline counterparts. Inversely, those who are 
uninterested in politics tend to avoid it both 
online and offline. Specifically, Lane e¢ al. 
found that sharing information about politics 
on social media predicted offline political ac- 
tivities such as attending political meetings, 
contacting public officials, and donating money 
to political campaigns (4). de Ziniga et al. (5) 
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found that the use of social media to address 
community problems, which they call “social 
media social capital,” predicted the propensity 
to engage in similar activities offline. And a meta- 
analysis of 106 survey studies of young people's 
civic and political use of digital media in >35 
countries found that the use of digital media for 
political purposes was positively correlated with 
offline political and civic engagement (6). 

The unanimity of the literature on this point 
has led some to declare that the clicktivism 
debate is conclusively settled (7). However, 
this conclusion is premature given several im- 
portant questions that lack solid empirical 
answers. One of the most pressing begins 
with the observation that political engage- 
ment is issue specific: An individual can be 
engaged with one or more issues and disen- 
gaged from others. The clicktivism question 
then evolves from whether low-cost digital ac- 
tivities exhaust one’s engagement with politics 
in general to whether such activities may do so 
for specific issues that lie beyond the person’s 
usual interests. For example, whereas liking, 
sharing, and posting memes about environ- 
mental topics may be just one of many ways an 
environmentalist engages with her pet issue, 
it may be the only way she does so for, say, 
Black Lives Matter when that movement is 
trending nationally on Twitter. The pat- 
tern of punctuated equilibrium that typ- 
ifies social movement activity on social 
media implies that some variant of this 
will be true at least some of the time. To 
continue with the Black Lives Matter 
example, a study that tracked related 
tweets over a 1-year period overlapping 
the movement's birth showed a few sharp 
peaks of interest (most prominently in 
August, November, and December of 2014 
and in April and May of 2015) separated 
by lengthy periods of much lower activity 
(Fig. 1) (8). This is typical of such movements’ 
social media activity and indeed of social me- 
dia in general (9). 

Logically, the bursts of attention that create 
such peaks must be provided by people (or bots, 
a non-negligible possibility) who engage for a 
short time and then depart, leaving a com- 
mitted core of activists to sustain the baseline 
conversation. Whether such participation is 
considered clicktivism is more a question of 
philosophy than empiricism. On the one hand, 
the degree of individual commitment is un- 
doubtedly low, but on the other, the aggregate 
crests of attention generated by thousands or 
millions of such actions can catapult a protest 
movement from obscurity to international pro- 
minence (J0). As Freelon et al. document (8), 
grassroots attention on social media played 
a substantial role in spreading the initial pub- 
lic awareness of Black Lives Matter’s existence 
and goals, which was an essential precursor 
to its widespread acceptance by the American 


lofs 


public in mid-2020 (11). Our hypothetical en- 
vironmentalist may not have engaged with 
Black Lives Matter at all if low-cost online 
actions were unavailable; thus, rather than 
substituting for higher-cost street-level activ- 
ism, online actions broaden symbolic support 
for movements (12). 

Our main arguments on clicktivism can be 
summarized thusly: There is a continuum of 
online activist participation ranging from post- 
ing and liking content to high-level decision- 
making as a full-time activist. Even more, as 
the remainder of this review clearly reveals 
through the lens of recent empirical research, 
low-cost digital activities can sum to sub- 
stantial effects ranging from publicizing 
movements for mass audiences to circulating 
disinformation that undermines democratic 
deliberation and processes. A number of 
American activist movements have substan- 
tially furthered their goals through digital 
means over the past decade, including Oc- 
cupy Wall Street, Black Lives Matter, #MeToo/ 
#TimesUp, far-right anti-immigration advo- 
cates, and the mens’ rights movement. Sim- 
ilar results have been observed outside of the 
United States (10, 12, 13). To add a right-wing 
example to the Black Lives Matter case de- 
tailed above, Benkler et al. explain how far- 
right media, activists, and social media users 
successfully introduced the term “globalist,” 
an anti-Semitic dog whistle, into the journal- 
istic mainstream (/4). This effort began with 
white nationalist sites such as VDARE and 
continued through Breitbart (a far-right site 
that avoids explicit white nationalism), Fox 
News, and the Trump administration after the 
2016 election, finally ending up as a synonym 
for “neoconservative” in The New York Times. 
The online-only media outlets at the begin- 
ning of this chain rely heavily on social media 
sharing to boost their messages (75). In the 
United States, this is the main way they attract 
the attention of Fox News, which is more di- 
rectly networked with more traditional media 
outlets and the Trump administration. Over- 
all, this example demonstrates how far-right 
actors can insert their preferred terminology 
and ideas into more “respectable” outlets that 
would otherwise try to avoid such associa- 
tions. Other studies have demonstrated that 
sites such as Breitbart (and their European 
counterparts) serve similar “bridging” func- 
tions between far-right and legacy media 
(6, 17). In these and other ways, slacktivism 
has been a consequential component of con- 
temporary social movements and will likely 
continue to be so in the future. 

The empirical record has very little to say 
on the question of ideological asymmetries in 
slacktivism, mostly because left-wing protests 
have been studied far more than right-wing 
protests (78). Based on what we know about 
how most areas of life typically work online, 
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we might expect that right-wing actors would 
use online and offline means to pursue their 
interests similarly to the way that those on 
the left do. One survey-based study found that 
for American respondents with low political 
interest, “easy political behaviors [such as 
liking and commenting on social media] can 
be gateway behaviors to more significant po- 
litical activities,” but that ideology was not a 
significant predictor of this tendency (19). 


Left- and right-wing digital strategies 
and ecosystems 


One of digital media’s most important contrib- 
utions to activism is how they have opened 
new pathways to reach target audiences. Be- 
fore the digital age, protesters who wished to 
project their messages nationally or interna- 
tionally had only one viable option: attracting 
the news media’s attention, which they usually 
did through street protests. Mailing lists and 
alternative media extended their reach only 
moderately. Today, digital media afford activ- 
ists across the political spectrum two general 
methods of promoting their causes. The first is 
to circumvent the news media entirely and 
appeal directly to digital platform users. This 
method offers the advantage of placing mes- 
sage control mostly in the hands of activists 
and sympathetic parties but by definition mostly 
reaches people who are already platform users. 
Second, activists use digital platforms to attract 
journalists’ attention (because most use social 
media extensively as a gauge of public opin- 
ion and as a source of stories) (20) in the hopes 
that they will cover their movement. The ad- 
vantage here is that news outlets can reach in- 
dividuals outside of the digital spheres within 
which activists operate, as well as those who 
are not digitally active at all, but may also alter 
activist messages in ways that are not always 
favorable to movements (27). These two methods 
are not mutually exclusive; many of the best- 
known activist movements in recent years have 
used both (2, 8, 22). 

Although activists on both sides use digital 
media to reach audiences directly and indi- 
rectly through the news media, the left and 
the right have each evolved their own dis- 
tinct style of doing so. The dominant style on 
the left has been labeled hashtag activism 
(2, 23, 24) and bears three main distinguish- 
ing characteristics. The first and foremost of 
these is the creation of a declarative hashtag 
to serve as the movement's unifying slogan; e.g., 
#BlackLivesMatter, #MeToo, and #Fightfor15 
became shorthand for a host of demands and 
priorities. The limited amount of attention 
that most people decide to allocate to news in 
general and activist appeals in particular gua- 
rantees that only a few protest hashtags will 
attain national or international prominence. 
Such hashtags often come to the public’s at- 
tention through news coverage of shocking 
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and disruptive events, such as Michael Brown’s 
death at the hands of police officer Darren 
Wilson in Ferguson, MO (#BlackLivesMatter), 
the disclosure of Harvey Weinstein’s decades- 
long history of sexual predation (#MeToo), and 
a series of American fast-food worker strikes 
in 2012-2013 (#Fightfor15). Second, such hash- 
tags are buoyed by the widespread engage- 
ment of nonelites, ordinary citizens who relate 
to the hashtag’s core message or simply want to 
declare their support. This is what causes them 
to “trend” on social media and thereby trigger 
the third element: attention and support from 
elite third parties. Most prominent among these 
are mainstream news outlets, which are often 
the first elites to publicize activist hashtags. Others 
include celebrities, businesses, and politicians, 
all of whom hold disproportionate power to 
direct attention to movements. Examples in- 
clude hip-hop artists Talib Kweli and Common 
(#BlackLivesMatter), ice cream company Ben & 
Jerry’s (#BlackLivesMatter), actress Alyssa 
Milano (#MeToo), and Senator Bernie Sanders 
(#Fightforl15). Although much hashtag activism 
research is U.S. focused, the phenomenon has 
also been observed in countries such as Argentina 
(25), Bangladesh (26), France (27), and India (27). 

The right engages with these dual pathways 
very differently. Several fundamental differ- 
ences with the left explain this. First, American 
conservatives’ mistrust of the mainstream 
news media has been intensifying for decades 
(28, 29), a pattern that seems to be common 
on the right across Europe and India as well 
(30-32). The sense that traditional news out- 
lets are irredeemably biased against con- 
servatives is one of the driving factors in the 
establishment of right-wing media ecosystems, 
the roots of which in the United States reach 
back at least to the 1930s (33). Second, con- 
servatives have more recently developed an 
analogous belief that “Big Tech,” a pejorative 
term for the companies that produce and main- 
tain the internet’s most widely used commu- 
nication platforms and hardware, including 
Facebook, Google, Twitter, Apple, and Amazon, 
is also biased against them (34). These two 
beliefs have led the right to interact with the 
news media and tech platforms in more radi- 
cally oppositional ways than the left despite 
the latter’s critiques of those institutions. Dis- 
taste for (and being deplatformed from) Big 
Tech has prompted some far-right users to de- 
camp to platforms more accepting of their 
politics, including Telegram, Gab, and Voat 
(35). Third, since 2016, the center-right’s pres- 
ence on social media has diminished substan- 
tially (14, 36, 37), leaving the far right as the 
dominant conservative presence. Together, 
these short- and long-term trends have shifted 
the right into a world apart from the left and 
center, and its activist tactics reflect that re- 
ality. Figure 2 quantifies this phenomenon 
by depicting the percentages of “fragmented” 
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Fig. 1. Daily tweets about police violence and Black Lives Matter, June 2014 to May 2015. Reproduced with permission from (8). See (58) for the data and 


code used in creating this figure. 


users that retweeted media outlets along the 
ideological spectrum in 2017. Outlets with pre- 
dominantly far-right audiences attracted nearly 
four times more fragmented users (those that 
disproportionately retweeted within one parti- 
tion) than the second most fragmented partition. 

Conservative mistrust of the mainstream 
media has inspired two distinctive tactics for 
interacting with two kinds of media outlets. 
Those that lack an explicitly conservative out- 
look often find themselves targeted by media 
manipulation, an umbrella term that refers 
to a repertoire of bad-faith tactics intended to 
attract journalistic attention (22). One of the 
most prominent of these, known as “trading 
up the chain,” involves planting a sensation- 
alistic hoax, conspiracy theory, or extreme 
viewpoint in a small or local news outlet that 
may not fact-check it (22, 38). This story may 
then be repeated by larger outlets, either be- 
cause of its content or because an elite (such 
as Donald Trump) has endorsed it. Whether 
the underlying claim is presented as true or 
debunked, the goal of spreading it further is 
fulfilled. By contrast, right-wing activists’ in- 
teractions with ideologically friendly outlets 
are understandably far less contentious. What 
Benkler et al. have called the American “right- 
wing media ecosystem” is a densely interlinked 
region of the media network that stands far 
apart from other media in terms of digital, 
professional, and ideological connections 
(14, 16). Its reach on social media platforms is 
extensive, in most cases larger than its left- 
wing equivalent (14). The ostensibly more 
journalistic outlets in this network, such as 
Fox News and the Daily Caller, regularly le- 
gitimize content surfaced by the more radical 
outlets, which include Infowars, Gateway 
Pundit, and Breitbart. The right-wing media 
ecosystem’s favored topics during the Trump 
administration have prominently included un- 


Freelon et al., Science 369, 1197-1201 (2020) 


compromising opposition to non-Western im- 
migration, the evils of the so-called “deep 
state,” and attacks on the legitimacy of the 
Mueller investigation (14). 

Two other tactics used disproportionately 
by right-wing actors are specific to social plat- 
forms. The first is the strategic manipulation 
of platform algorithms to increase attention 
to desired messages. Much as the gatekeep- 
ing function of legacy journalism shaped the 
norms, practices, and patterns of news cov- 
erage of social movements, social platforms’ 
emphasis on user engagement affects what 
information is displayed to individual users, 
for example, by giving greater reach to emo- 
tionally charged content, videos, and visual 
graphics over text (39). Thus, successful online 
activists must understand how social plat- 
forms algorithmically sort content to ensure 
that their own is given priority. Although 
both left- and right-wing actors engage in such 
tactics, preliminary evidence suggests that the 
right has been more successful. For instance, 
platforms such as YouTube have recommen- 
ded increasingly extreme far-right content to 
viewers of more moderate right-wing channels 
to maximize user engagement with the site 
(40). Similar techniques include optimizing 
search engine keywords so that interested 
parties will more readily find ideologically 
biased results (47) and the use of fake accounts 
and bots to imply widespread consensus on 
social media (42). Because journalists often 
rely on engagement metrics such as Twitter's 
“Trending Topics” to determine which stories 
should be covered and how they should be 
framed, successful algorithmic manipulation 
may help to set legacy media agendas (22). 

Second, in response to deplatforming, shadow 
banning, and content moderation by Big Tech, 
some right-wing actors have migrated to “alt- 
tech” equivalents that offer more permissive 


4 September 2020 


moderation. These include social media sites 
dedicated to right-wing communities, such 
as 4chan and 8chan, the Twitter alternatives 
Parler and Gab, and the YouTube alternative 
BitChute, as well as more ideologically neutral 
platforms such as Discord and Telegram (35). 
Although alt-tech platforms are much smaller 
than their mainstream counterparts, they al- 
low partisan and fringe communities to exist 
without opposition from alternative viewpoints. 
Studies have demonstrated a high prevalence 
of hate speech on 4chan (43), Gab (44), and 
BitChute (45), which is typically moderated 
on more mainstream social platforms. These 
spaces allow more extreme viewpoints to 
thrive, whereas mainstream social media pri- 
marily host less extreme content designed to 
reach wider audiences (22). 

The most relevant implications of the dif- 
ferences between how left- and right-wing ac- 
tivist networks reach their respective audiences 
derive from their very different relationships 
with the platforms they use. The left largely 
engages directly with traditional and social 
media, using them as primary communication 
venues to develop and distribute activist mes- 
sages. These outlets and platforms present 
themselves as what Cass Sunstein called “gen- 
eral interest intermediaries” (46), information 
environments that admit a wide range of per- 
spectives. Consequently, left-wing ideas tend 
to connect with individuals and institutions 
along a much broader range of the ideological 
spectrum than the right, including much of 
the center (74). By contrast, the right has created 
and used its own ideologically exclusive media 
ecosystem and digital platforms even as it 
continues to engage with the best-known tech 
platforms and news outlets out of necessity. 
These developments in turn (along with other 
nondigital factors) fuel what scholars have 
called “asymmetric polarization,” the proposition 
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that conservatives have grown more extreme 
over the past few decades than liberals (14, 37, 47). 
Asymmetric polarization’s broader consequences 
include less common ground between opposing 
political sides, increasingly extreme policies 
when conservatives are elected, and more 
opportunities for ideologically branded mis- 
and disinformation to spread on the right, 
which we discuss further in the next section. 


Emerging research on 
asymmetric disinformation 


Since the 2016 U.S. and U.K. Brexit elections, 
scholars, the news media, and international 
publics have become increasingly 


Existing research provides numerous exam- 
ples of conservative-targeted disinformation, 
in which right-wing media ecosystems around 
the world are often centrally implicated 
(49, 56, 57). In the United States, the alt-right, 
unapologetic white nationalists, and others on 
the rightmost fringe attract relatively small 
audiences and must rely on media outlets at 
higher levels of the ecosystem to help circulate 
their disinformation and other extreme ideas 
broadly (74). The fringes are not always suc- 
cessful; in particular, conspiracy theories im- 
plicating a Washington, DC, pizza parlor as the 
center of a Democrat-controlled pedophilia 


of 2020 prominently includes right-wing media 
ecosystem members such as GatewayPundit 
(@gatewaypundit) and commentators for Fox 
News (@greggutfeld) and Infowars (@liberty- 
tarian) (58). In this way, the right-wing media 
ecosystem circulates sensationalistic content 
to an ideologically friendly audience free of the 
sorts of editorial practices that would prevent 
the spread of false information. The goal, as 
with much disinformation, is to support the in- 
group and denigrate the outgroup, even at the 

expense of verifiable truth. 
Perhaps because of the implications of the 
research reviewed above, very few studies 
have directly investigated online left- 


concerned with the problem of false 106,008 / wing disinformation or conspiracy 
and misleading political content 50 224,381 theories at scale. The studies show- 
(14, 22, 48, 49). This general pheno- * 45 ing a conservative-leaning asymmetry 
menon has multiple variants with  § 49 in social media false news sharing 
a variety of labels, including the ubiq- S 35 largely draw their data from before the 
uitous and ambiguous “fake news,” 2 2016 election (53, 54). If liberals have 
which we avoid. Here, we will focus = 30 changed in their susceptibility to dis- 
on disinformation, which we define & 25 information in the ensuing years, pos- 
as “all forms of false, inaccurate, or 5 20 sibly because of incentives introduced 
misleading information designed, pre- So 15 ee by strong anti-Trump animus, we do 
sented and promoted to intentionally 5 10 : — 1 aie not yet know. This could be a case of 
cause public harm or for profit” (48). pe 1 | 260 248 632/ failing to find that which is not sought. 
Unlike misinformation, which refers a 5 - : 54,932 The implications of such research are 
to misleading content spread inadver- = : highly relevant to democratic practice: 
tently, disseminators of disinformation Farleft Centerleft Center Centerright Far right For one, they will help us understand 


know their messages are deceitful. Actors 
behind such deceptive content seek to 
spread conspiracy theories, false rumors, 
hoaxes, and inflammatory opinions to 
promote their ideological viewpoints, 
decrease trust in mainstream insti- 
tutions, and recruit others to their 
causes (22). 

The relevant literature offers three 
types of evidence in support of the 
proposition that disinformation is 
more prevalent on the right than on 
the left, although to our knowledge this has 
not been directly tested. First, evidence from 
psychological studies indicates that conserva- 
tive individuals are more likely than liberals 
to prefer the kinds of closed media environ- 
ments (sometimes called “echo chambers”) 
that facilitate the spread of mis- and disinfor- 
mation (50), believe conspiracy theories when 
cued by official denials of conspiratorial causes 
(5D, and tolerate the spreading of disinforma- 
tion by politicians (52). Second, analyses of false 
news diffusion on social media have generally 
shown a tendency for conservatives to share 
such content more than liberals (53, 54). Third, 
the most visible mainstream news media out- 
lets, upon which the left relies much more 
heavily for political information than the right, 
have a long history of fact-checking norms 
that largely prevent disinformation from thriv- 
ing there (J4), which is why understanding 
how the news industry operates helps individ- 
uals avoid disinformation (55). 
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Fig. 2. Percentages of fragmented users retweeting media outlets 
across five ideological partitions. The denominator for each percentage is 
the number of users who retweeted (shared content from) at least one 
media outlet in that partition, whereas the numerator is the number of users 
for whom at least 80% of their retweets were of outlets in that partition 
(ie., “fragmented” users). This figure depicts the behavior of the 1.82 million 
unique Twitter users in the dataset who retweeted three or more media 
accounts. The dataset upon which this figure is based comes from (37) and 
contains >88 million tweets about six major news issues throughout 2017. 
Media outlets and corresponding ideological classifications come from 
(14). See (58) for the data and code used in creating this figure. 


ring and accusing a left-wing activist of mur- 
dering a counterprotester at the 2017 Unite the 
Right rally were not endorsed by the ecosys- 
tem’s upper echelons (/4, 38). The ranks of 
disinformation stories that achieved greater 
notoriety include the Seth Rich conspiracy, 
in which a Hillary Clinton staffer was allegedly 
murdered because of what he knew about her 
emails. (Rich was killed in Washington, DC, 
on 10 July 2016 by unknown assailants, but no 
credible evidence links his death to Clinton.) 
The story originated among fringe ecosystem 
users on Twitter and Reddit in the weeks after 
Rich’s death (/4). Sean Hannity covered the 
conspiracy multiple times in 2017 on his 
eponymous Fox News program, although the 
network eventually retracted the story. More 
recently, our analysis shows that the top ranks of 
the Twitter network discussing the debunked 
2020 documentary Plandemic (which makes 
unsubstantiated and scientifically unsound 
allegations about COVID-19) in April and May 
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the extent of the problem, who is most 
acutely affected, and under what con- 
ditions. Understanding the ideological 
and psychological antecedents of dis- 
information susceptibility is an impor- 
tant first step in targeting interventions 
to counteract it. To the extent that we 
as citizens value a democracy free of 
fraudulent attempts at opinion manip- 
ulation, we should investigate all con- 
texts in which it might lurk. 

Two existing studies, along with our 
own analysis of recent Twitter data, offer some 
evidence that left-leaning disinformation may 
not be as rare as the literature suggests. First, 
research published by Buzzfeed in October 2016 
found that although conservative Facebook 
pages posted nearly double the proportion of 
false or partly false content as liberal pages, 
such content garnered much higher median 
shares per post on left-wing pages than on 
right-wing ones (59). (We should note that 
this report only analyzed six Facebook pages 
in total, its data were not made public, and it is 
possible that false content on right-leaning 
pages accrued more shares in total given that 
there was more of it.) Second, a recent study 
found that tweets posted by Russian disinfor- 
mation agents masquerading as left-wing 
African American activists attracted more 
attention on a per-tweet basis than either 
those by conservative identities or non-Black 
left-leaning identities (60). This demonstrates 
a level of vulnerability to disinformation on 
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the left that is not often acknowledged. Third, 
we find that tweets mentioning the key words 
“anonymous” and “trump” posted between 
31 May 2020 (when the Anonymous hacktivist 
collective released a cache of documents pur- 
porting to prove, among other accusations, 
that Donald Trump was involved in child sex 
trafficking) and 2 June 2020 were retweeted 
>1.1 million times, more than double the total 
retweet count for Plandemic in our analysis 
above (58). In contrast to the Plandemic net- 
work, the most-retweeted users on this topic 
are overwhelmingly nonelites with few fol- 
lowers (except for @youranoncentral, which is 
ostensibly controlled by Anonymous), not well- 
known liberals or mainstream news outlets. 
We acknowledge that these findings are pre- 
liminary and raise pressing validity questions— 
many of the attention metrics boosting these 
stories could have been generated by bots, for 
example—but we include them here for lack 
of more rigorous research on the matter. 
Taken together, we believe that they suffice 
to justify further investigation into disinfor- 
mation aimed at the left. 


Conclusion and future research 


This review offers three main sets of conclu- 
sions. First, people participate in online activ- 
ism along a wide spectrum of commitment 
levels, from liking and sharing content, to 
the back-and-forth of political discussion, to 
involvement as core movement leaders. Low- 
cost online actions do not harm activist goals; 
on the contrary, they help to boost activist 
topics and concerns to the levels of public 
visibility necessary to enact or prevent change. 
Both the left and right benefit from this basic 
dynamic of online activism. However, there is 
still much to learn about how clicktivism op- 
erates; for example, we still do not know how 
frequently hashtag-based conversations or 
signal-boosting extreme perspectives change 
people’s minds or behaviors. Second, the left 
and right generally engage in two distinct 
styles of online outreach: hashtag activism 
and online advocacy spearheaded by the right- 
wing media ecosystem, respectively. The iso- 
lation of the far right from the rest of the 
ideological spectrum results in asymmetric 
polarization and complicates the process of 
governing ideologically diverse polities. Key 
areas for future research here include mea- 
suring the relative capacities of these two 
styles in reaching, persuading, mobilizing, and 
antagonizing elites and nonelites on both 
sides. Third, disinformation distribution ap- 
pears to be one of the key functions of right- 
wing media ecosystems. However, the marked 
lack of research on left-wing disinformation 
leaves many questions about how it operates, 
who is most at risk, and how serious a problem 
it is, making such research an urgent priority. 
The very limited number of studies on right- 
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wing online protest and activist hashtag use 
is similarly glaring. Moving forward, research- 
ers should endeavor to discover whether our 
current empirical understanding of left- and 
right-wing activism online represents reality 
faithfully or is a product of systematic gaps in 
case selection. 
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Lunar hematite holds clues 


Edited by Michael Funk 


he poles of the: Moon may contain a 
record of billions of years of oxygen 
isotopes of Earth’s atmosphere. The 
lunar surface is highly reducing, yet 
asrecent study using data collected * 
by the Moon Mineralogy Mapper provides 
strong evidence for the presence of hema 
tite, acommon iron oxidation product 
on Earth, at high lunar latitudes. Li et al. 
mapped the hematite deposits, noting that 
they are mostly found in association with 
east equator—facing sides of topographic 
highs on the lunar nearside of the Moon. 
They propose that oxygen, delivered from 
Earth’s upper atmosphere, could produce 
a locally oxidizing environment, allowing 


hematite to form. —RK 


Sci. Adv. 10.1126/sciadv.abal940 (2020). 


Terrestrial 
biogeochemistry of silicon 


Silicon is an important element 
in plant tissues and contributes 
to structural defenses against 
herbivores and other stresses. 
However, the terrestrial bio- 
geochemical cycling of silicon 
is poorly understood, particu- 
larly the relative importance of 
geochemical and biological 
mechanisms in its regulation. 
de Tombeur et al. studied this 
question in 2-million-year 
chronosequences of soil and 
vegetation in Western Australia. 
Sites became progressively 
more weathered and infertile 

as they aged, indicating that 
the silicon cycle shifts from 
geochemical to biological control 
as the ecosystem develops (see 
the Perspective by Carey). They 
found that foliar silicon concen- 
trations increase continuously 


SCIENCE sciencemag.org 


during ecosystem develop- 
ment, even though rock-derived 
silicon is depleted in the older 
soils. By contrast, other major 
rock-derived nutrients showed 
decreasing concentrations in 
plants. Hence, biological silicon 
cycling allows plants to maintain 
concentrations even under con- 
ditions of extreme soil infertility. 
—AMS 

Science, this issue p. 1245; 

see also p. 1161 


Finding the start 

Eukaryotic translation involves 
many players in a dynamic and 
well-orchestrated process. A 
A3S preinitiation complex (PIC) 
comprises the 40S ribosomal 
subunit; initiation factors, includ- 
ing the elF3 complex, which is 
known to play a key role; and the 
transfer RNA used for translation 
initiation. The PIC is recruited to 


the cap-binding complex elF4F 
at the 5' end of messenger RNA 
(mRNA) to form a 48S complex 
that scans along the MRNA for 
a start codon. Brito Querido et 
al. determined the structure 
of areconstituted human 48S 
complex using cryo—electron 
microscopy. They found that 
elFAF binds to elF3 near the 
exit site of the ribosome. This 
positioning suggests that down- 
stream mRNA is likely pulled 
through the 40S subunit to find 
the start codon. —VV 

Science, this issue p. 1220 


Aviral block on host 
protein synthesis 


As the coronavirus disease 2019 
(COVID-19) pandemic continues 
to cause devastation, scientists 
race to increase their under- 
standing of the disease-causing 
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Remote observation 
reveals the mineral 
hematite on steep walls 
of lunar craters. 


severe acute respiratory syn- 
drome coronavirus 2. Once inside 
host cells, not only does the 
virus hijack the cells’ transla- 
tional machinery to make viral 
proteins, but the virulence factor 
nonstructural protein 1 (Nsp1) 
also shuts down translation of 
host messenger RNA. Thoms et 
al. determined a 2.6-angstrom 
resolution cryo—electron micros- 
copy structure of a reconstituted 
complex of Nsp1 bound to the 
human 40S ribosomal subunit 
and showed that Nsp1 blocks the 
messenger RNA entry tunnel. 
A structural inventory of native 
Nsp1-ribosome complexes 
from human cells confirms this 
mechanism. Cellular studies 
show that the translational shut- 
down almost completely inhibits 
the innate immune response. The 
binding pocket on the ribosome 
may be a target for drugs to treat 
COVID-19. —-VV 

Science, this issue p. 1249 
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Accounting for heat | 
burdens 


As climate warming becomes 
more and more apparent and 
influential, there is an increasing 
desire to predict its long-term 
impacts on species. Classically, 
this has been done by extrapo- 
ating lethal limits based on 
those observed in the laboratory. 
In the real world, however, organ- 
isms do not experience a single 
high temperature that then 
returns to a comfortable temper- 
ature, but rather a series of high 
temperatures during the hot 
season. Rezende et al. accounted 
for these accumulative effects in 
a dynamic model that accurately 
predicted empirical patterns in 
wild fruit fly populations, show- 
ing that cumulative effects of 
warming temperatures can be 
included in future estimates (see 
the Perspective by Huey and 
Kearney). —SNV 

Science, this issue p. 1242; 

see also p. 1163 


Ripping up a 
circumstellar disk 

During the process of star 
formation, a disk of gas and 
dust forms around the young 
star, controlling the accretion 
of more material. Once the star 
has formed, any leftover mate- 
rial in this circumstellar disk 
can form planets. If a binary or 


triple star forms at the center 
of the disk, theoretical mod- 
els predict that tidal torques 
caused by their orbits can rip 
the disk apart, in a process 
known as disk tearing. Kraus et 
al. observed the nearby young 
triple-star system GW Orionis 
with multiple near-infrared and 
submillimeter telescopes, using 
the techniques of interferome- 
try and polarimetry. They found 
evidence for multiple rings 
with different orientations and 
warping of part of the disk, both 
produced by disk tearing. —KTS 
Science, this issue p. 1233 


Opioid signaling rewards 
steroid abuse 


The dopamine system medi- 
ates feelings of reward and 
is implicated in addiction to 
various drugs, including anabolic 
androgenic steroids. Bontempi 
and Bonci found that the neu- 
rological and behavioral effects 
of androgenic steroids were 
indirectly mediated by opioid 
receptors on dopaminergic 
neurons. A single injection of 
anabolic steroids in mice stimu- 
lated the release of B-endorphins 
in a dopaminergic neuron-rich 
brain region, which activated 
u-opioid receptor signaling. 
Blocking p-opioid receptor acti- 
vation prevented drug-seeking 
behavior in steroid-injected 
mice. —LKF 

Sci. Signal. 13, eabal169 (2020). 


Computed image of a warped circumstellar disk in a triple-star system 
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Bending bimetallic 
nanowhiskers 


Metal nanocrystal wires can be 
very strong, because deforma- 
tion requires nucleation of new 
dislocations; thus, bending 
them by mechanical loading 

is difficult to control and can 
even break the nanowires. Qi et 
al. used molecular beam epi- 
taxy to grow single-crystalline 
<011>-oriented gold nanowires 
(~360 nanometers in diameter 
and several micrometers long) 
and then coated one side with 
iron layers either ~50 or ~200 
nanometers thick. Heating 
these bimetallic nanowhis- 
kers (up to 500°C) gradually 
induced irreversible bending. 
Diffusion of iron into the gold 
nanowhiskers caused bending 
through the change in lattice 
parameter of the alloy as well 
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as a volume effect caused 

by the interphase boundary 

migrating toward the iron layer. 

—PDS 

ACS Nano 10.1021/acsnano.0c04327 
(2020). 


Heart scars 

The healing process often 
involves scar formation. 
However, scars display lower 
elasticity than normal tissue 
and can interfere with tissue 
function. Such problems arise 
after myocardial infarction, with 
increasing scar size correlating 
with higher mortality. Yokota et 
al. examined gene expression 
after myocardial infarction in 
mice and found that collagen V 
regulates scar formation. If 
collagen V is absent, increased 
myofibroblast formation and 
expression of extracellular 
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CONSERVATION 
Air pollution is 
bad for bees 


ndia is the second most popu- 
lous country and has 9 out of 
the 10 most polluted cities in 
the world. Pollution there, as 
everywhere, is known to harm 
human health and well-being, 
but little work has character- 
ized its direct effects in nature. 
Thimmegowda et al. looked at 
the impact of air pollution on 
the giant Asian honey bee (Apis 
dorsata) in and around Bangalore 
and found that there was a direct 
correlation between pollution 
levels and differences in flower 
visitation behavior, heart rate, 
hemocyte levels, and ultimately 
survival. Results from experimen- 
tal exposure of laboratory-reared 
Drosophila at the same sites 
where bees were collected sup- 
ported the conclusion that these 
changes were caused by air pollu- 
tion. These results have important 
implications for pollution harming 
ecosystem services, as well as 
humans. —SNV 


Proc. Natl. Acad. Sci. U.S.A. 117, 20653 
(2020). 


matrix genes results in a larger 
scar size that affects the biome- 
chanical property of the tissue. 
The mechanosensitive feedback 
can be rescued by inhibition of 
specific integrins, indicating a 
way to treat injuries and prevent 
subsequent tissue dysfunction. 
—BAP 


Cell 182, 545 (2020). 


MATERIALS SCIENCE 
Metallic glass fibers 


Some metallic alloys can be 
rapidly cooled into a glassy 
state, which can improve their 
corrosion and wear-resistance 
properties while maintaining 
conductivity. However, such 
rapid cooling usually limits 

the size and shape they can 
assume, and many of the pro- 
cessing methods aren't feasible 
without losing the glassy state. 
Yan et al. fabricated meters 


SCIENCE sciencemag.org 


of uniform fibers with widths 
ranging from 40 nanome- 
ters to a few micrometers by 
coating metallic glass with a 
polymer with similar rheologi- 
cal properties. They used the 
extracted fibers as electrodes 
in fabrics and combined them 
with biocompatible polymers 
that can be used to electrically 
stimulate and record the activ- 
ity of neurons. —MSL 
Nat. Nano 10.1038/ 
s41565-020-0747-9 (2020). 


WORKFORCE DIVERSITY 
Arubric for keeping on 


(tenure) track 

Diversifying faculty begins with 
addressing biases in hiring and 
providing equity and account- 
ability in the training and 
mentoring of future faculty. 
Clement et al. interviewed 


current faculty at 20 U.S. 
institutions and identified 

14 qualifications and levels 

of achievement required for 
obtaining a faculty position 

in the life sciences, ultimately 
developing the validated 
Academic Career Readiness 
Assessment (ACRA) rubric. 
ACRA can be implemented 

in several ways: to provide 
formative feedback and iden- 
tify training opportunities for 
future faculty, to provide trans- 
parency and standardization 
for evaluating faculty candi- 
dates for departments, and to 
evaluate training programs for 
funding agencies. Not limited 
to the life sciences, ACRA is a 
blueprint for the characteriza- 
tion and evaluation of career 
readiness across disciplines, 
career types, and education 
levels. —MMc 


CBE Life Sci. Educ. 19, ar22 (2020). 


VIROLOGY 
Strategies to stay 


or to leave 


Virus RNA within a cell is 
either packaged into prog- 
eny viral particles for release 
(“leave strategy”) or serves 
as a template for replication 
(“stay strategy”). To test the 
implications of leave versus 
stay strategies for transmis- 
sion and replication efficiency, 
Iwanami et al. selected 
hepatitis C virus (HCV) strains 
with differing virion release 
characteristics. A series of 
culture experiments showed 
that a strain isolated from a 
fulminant hepatitis patient 
that transmits efficiently 
packages a higher proportion 
of RNA into particles (leave 
strategy). Conversely, a virus 
strain engineered to maximize 
productivity favors retention of 
RNA (stay strategy). A model 
constructed using data from 
time courses of viral produc- 
tion, infectivity of progeny, 
and infected cell numbers 
provides insights into viral 
decay dynamics and possible 
mechanisms of action of anti- 
HCV drugs. —CA 


PLOS Biol. 18, e3000562 (2020). 
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PLANT SCIENCE 
Growing straight 
and strong 


Wood is made of fibrils spun 
out of cellulose, which are 
major load-bearing polymers. 
The cellulose synthase com- 
plexes, embedded in the cell 
membrane, are aligned with 
intracellular microtubules that 
lie just inside the plant cell 
membrane. Those connections 
are enabled by a protein called 
CSI1. Bunder et al. found that 
aspen trees with less than 
the normal complement of 
CSI1 had twisted leaves, leaf 
epidermal (pavement) cells 
that lacked their normally 
refined puzzle shapes, and 
wood that was brittle and 
mechanically weak. The cause 
seemed to be less cellulose 
polymerization than normal, 
which undermined the sturdy, 
fined-grained structure of 
aspen wood. —PJH 

Plant J. 10.1111/tpj.14873 (2020). 


A two-month-old greenhouse- 
grown CSIIRNAi Populus tremula x 
tremuloides tree, which shows 
growth defects owing to a reduction 
in cellulose biosynthesis 
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CELL BIOLOGY 
Reconstituting 
autophagosome nucleation 


To stay healthy, our cells must 
constantly dispose of harm- 
ful material. Autophagy, or 
self-eating, is an important 
mechanism to ensure the clear- 
ance of bulky material. Such 
material is enwrapped by cellular 
membranes to form autophago- 
somes, the contents of which are 
then degraded. The formation of 
autophagosomes is a compli- 
cated process involving a large 
number of factors. How they act 
together in this process is still 
enigmatic. Sawa-Makarska et al. 
recapitulated the initial steps of 
autophagosome formation using 
purified autophagy factors from 
yeast. This approach elucidated 
some of the organizational 
principles of the autophagy 
machinery during the assembly 
of autophagosomes. —SMH 
Science, this issue p. 1206 


PHYSIOLOGY 
Body clock resilience 


The body clock, or circadian 
rhythm, which couples activi- 
ties and homeostatic processes 
to daylight, is different in men 
and women. Evidence suggests 
that women have higher peaks 
of activity during earlier parts 
of the day than men, in-line with 
children, and that they are more 
resilient to shifts in daylight (as 
would occur when changing 
time zones or with shift work). 
In a Perspective, Anderson and 
FitzGerald discuss the possible 
mechanisms and implications 
of different circadian rhythms in 
men and women and how these 
may affect health. —GKA 
Science, this issue p. 1164 


1205-B 


REGENERATION 
Regulatory elements of 
fish regeneration 


Some animals regenerate exten- 
sively, whereas others, such as 
mammals, do not. The reason 
behind this difference is not 
clear. If the genetic mechanisms 
driving regeneration are evolu- 
tionarily conserved, the study of 
distantly related species that are 
subjected to different selective 
pressures could identify distin- 
guishing species-specific and 
conserved regeneration-respon- 
sive mechanisms. Zebrafish and 
the short-lived African killifish 
are separated by ~230 million 
years of evolutionary distance 
and, as such, provide a biological 
context to elucidate molecular 
mechanisms. Wang et al. identify 
both species-specific and evolu- 
tionarily conserved regeneration 
programs in these fish. They also 
provide evidence that elements 
of this program are subjected to 
evolutionary changes in verte- 
brate species with limited or no 
regenerative capacities. —BAP 
Science, this issue p. 1207 


NEUROSCIENCE 
Sleep and basal 
forebrain activity 


Different patterns of neural 
activity in the brain control the 
sleep-wake cycle. However, how 
this activity contributes to sleep 
homeostasis remains largely 
unknown. Adenosine in the 
basal forebrain is a prominent 
physiological mediator of sleep 
homeostasis. Using a newly 
developed indicator, Peng et al. 
monitored adenosine con- 
centration in the mouse basal 
forebrain. There was a clear 
correlation with wake state and 
REM sleep. Activity-dependent 
release of adenosine could also 
be elicited after optogenetic 
stimulation of basal forebrain 
glutamatergic, but not choliner- 
gic, neurons. These findings offer 
new insights into how neuronal 
activity during wakefulness 
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contributes to sleep pressure 
through the release of sleep- 
inducing factors. —PRS 

Science, this issue p. 1208 


CORONAVIRUS 
Immune profiling of 
COVID-19 patients 


Coronavirus disease 2019 
(COVID-19) has affected mil- 
lions of people globally, yet how 
the human immune system 
responds to and influences 
COVID-19 severity remains 
unclear. Mathew et al. present a 
comprehensive atlas of immune 
modulation associated with 
COVID-19. They performed high- 
dimensional flow cytometry of 
hospitalized COVID-19 patients 
and found three prominent 
and distinct immunotypes that 
are related to disease sever- 
ity and clinical parameters. 
Arunachalam et al. report a sys- 
tems biology approach to assess 
the immune system of COVID-19 
patients with mild-to-severe 
disease. These studies provide 
a compendium of immune cell 
information and roadmaps for 
potential therapeutic interven- 
tions. -PNK 

Science, this issue p. 1209, p. 1210 


PROTEIN DESIGN 
Anew tool in the protein 
design toolbox 


Protein design can compute pro- 
tein folds from first principles. 
However, designing new proteins 
that are functional remains 
challenging, in part because 
designing binding interactions 
requires simultaneous optimiza- 
tion of protein sequence and 
protein-ligand conformation. 
Polizzi and DeGrado designed 
proteins from scratch that bind 
a small-molecule drug (see the 
Perspective by Peacock). They 
introduced a new structural ele- 
ment called a van der Mer (vdM), 
which tracks the orientation 

of a chemical group relative to 
the backbone of a contacting 


Published by AAAS 


residue. Assuming proteins 
bind ligands using interactions 
similar to intraprotein packing, 
they determined statistically 
preferred vdMs from a large set 
of structures in the Protein Data 
Bank. By including weighted 
vdMs in their computations, 
they designed two of six de novo 
proteins that bind the drug 
apixaban. A drug-protein x-ray 
crystal structure confirmed the 
designed model. —VV 

Science, this issue p. 1227; 

see also p. 1166 


METROLOGY 
Avery precise ratio 


The value of the ratio of the 
masses of the proton and the 
electron has a bearing on the 
values of other physical con- 
stants. This ratio is known to a 
very high precision. Patra et al. 
improved this precision even 
further by measuring particular 
frequencies in the rovibrational 
spectrum of the hydrogen 
deuteride molecular ion (HD*) 
(see the Perspective by Hori). 
To reach this high precision, 
the researchers placed the HD* 
molecules in an ion trap and sur- 
rounded them by beryllium ions. 
The cold beryllium ions then 
helped cool the HD* molecules, 
making the HD* spectral lines 
narrow enough that the proton- 
electron mass ratio could be 
extracted by comparison with 
theoretical predictions. —JS 
Science, this issue p. 1238; 
see also p.1160 


CORONAVIRUS 
The spread of 
SARS-CoV-2 in Brazil 


Brazil has been hard-hit by 

the severe acute respiratory 
syndrome coronavirus 2 (SARS- 
CoV-2) pandemic. Candido et al. 
combined genomic and epidemi- 
ological analyses to investigate 
the impact of nonpharmaceuti- 
cal interventions (NPIs) in the 
country. By setting up a network 
of genomic laboratories using 
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harmonized protocols, the 
researchers found a 29% posi- 
tive rate for SARS-CoV-2 among 
collected samples. More than 
100 international introductions 
of SARS-CoV-2 into Brazil were 
identified, including three clades 
introduced from Europe that 
were already well established 
before the implementation of 
NPIs and travel bans. The virus 
spread from urban centers to the 
rest of the country, along with a 
25% increase in the average dis- 
tance traveled by air passengers 
before travel bans, despite an 
overall drop in short-haul travel. 
Unfortunately, the evidence 
confirms that current interven- 
tions remain insufficient to keep 
virus transmission under control 
in Brazil. —CA 

Science, this issue p. 1255 


CORONAVIRUS 
Adecoy receptor for 
SARS-CoV-2 


For severe acute respiratory 
syndrome coronavirus 2 (SARS- 
CoV-2) to enter human cells, the 
spike protein on the surface of 
the virus must bind to the host 
receptor protein, angiotensin- 
converting enzyme 2 (ACE2). A 
soluble version of the receptor is 
being explored as a therapeutic. 
Chan et al. used deep mutagene- 
sis to identify ACE2 mutants that 
bind more tightly to the spike 
protein and combined muta- 
tions to further increase binding 
affinity (See the Perspective by 
DeKosky). A promising variant 
was engineered to be a stable 
dimer that has a binding affin- 
ity for the spike protein; it is 
comparable with neutralizing 
antibodies and neutralized both 
SARS-CoV-2 and SARS-CoV-1 in 
a cell-based assay. In addition, 
the similarity to the natural 
receptor may limit the possibility 
for viral escape. —VV 

Science, this issue p. 1261; 

see also p. 1167 
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NEURODEGENERATION 
Unraveling protein 
clumping 

Repeat expansion in the C9orf72 
gene causes amyotrophic lateral 
sclerosis (ALS) and fronto- 
temporal dementia (FTD), two 
neurodegenerative disorders 
with common features. A 
proportion of patients with ALS 
or FTD present cytoplasmatic 
aggregates of a protein called 
TDP-43 in the brain. The mecha- 
nisms mediating the formation 
of TDP-43 aggregates are 
unclear. Cook et al. now show 
that a poly-glycine-arginine 
protein (poly-GR) produced by 
the repeat expansion enhanced 
the formation of TDP-43 aggre- 
gates in vitro and in vivo in mice 
by altering nucleocytoplasmic 
transport. Targeting the repeat 
expansion with a specific anti- 
sense oligonucleotide reduced 
the formation of TDP-43 aggre- 
gates. The results illuminate 

the mechanisms mediating the 
formation of toxic aggregates 

in neurodegenerative diseases. 
—MM 

Sci. Transl. Med. 12, eabb3774 (2020). 


WOUND HEALING 
A pain sensor promotes 
regeneration 


Wound healing in mammalian 
skin often results in fibrotic 
scars, and the mechanisms by 
which original nonfibrotic tissue 
architecture can be restored 
are not well understood. Wei et 
al. found that pharmacological 
activation of the pain sen- 

sor TRPAI, which is found on 
cutaneous sensory neurons, 
can limit scar formation and 
promote tissue regeneration. 
They confirmed the efficacy 

of TRPA1 activation in three 
different skin-wounding mouse 
models and observed that local- 
ized activation could generate 
a response at distal wound 
sites. TRPA] activation induced 
interleukin-23 production by 
dermal dendritic cells, which 
activated interleukin-17—pro- 
ducing y8 T cells and promoted 
tissue regeneration. These 
findings provide insight into 


neuroimmune signaling path- 
ways in the skin that are critical 
to mammalian tissue regenera- 
tion. —CNF 

Sci. lmmunol. 5, eaba5683 (2020). 
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CELL BIOLOGY 


Reconstitution of autophagosome nucleation defines 
Atg9 vesicles as seeds for membrane formation 


Justyna Sawa-Makarska*+}, Verena Baumann’, Nicolas Coudevylle*, Séren von Biilow, 
Veronika Nogellova, Christine Abert, Martina Schuschnig, Martin Graef, 


Gerhard Hummer, Sascha Martenst+ 


INTRODUCTION: Macroautophagy (hereafter 
autophagy) is an evolutionarily conserved 
lysosomal degradation pathway. It ensures 
cellular homeostasis and health by removing 
harmful material from the cytoplasm. Among 
the many substances that are degraded by 
autophagy are protein aggregates, damaged 
organelles, and pathogens. Defects in this path- 
way can result in diseases such as cancer and 
neurodegeneration. During autophagy, the 
harmful material, referred to as cargo, is se- 
questered by double-membrane vesicles called 
autophagosomes, which form de novo around 
the cargo. Autophagosome formation occurs 
at sites close to the endoplasmic reticulum 
(ER). The process is catalyzed by a complex 
machinery that includes protein and lipid ki- 
nases, membrane binding and transfer pro- 
teins, and ubiquitin-like conjugation systems. 
How these components and biochemical activ- 


ities act in concert to mediate autophagosome 
formation is incompletely understood. Partic- 
ularly enigmatic are autophagy related protein 
9 (Atg9)-containing vesicles that are required 
for the assembly of the autophagy machinery 
but do not supply the bulk of the autophago- 
somal membrane. 


RATIONALE: To understand the mechanism of 
how the various biochemical activities of the 
autophagy machinery are orchestrated during 
the nucleation and expansion of the precur- 
sors to autophagosomes at the cargo, we fully 
reconstituted these events using the yeast 
machinery. Specifically, we used recombinant- 
ly expressed and purified proteins in combi- 
nation with reconstituted Atg9 proteoliposomes 
and endogenous Atg9 vesicles isolated from 
cells. Our reconstituted system included 
21 polypeptides, as well as membrane platforms, 


Atg8 conjugation 


Assembly of the yeast autophagy machinery. Model for the assembly of the yeast autophagy machinery 
and Atg2-mediated lipid transfer into Atg9 vesicles from a donor compartment, such as the endoplasmic 


reticulum, during the nucleation of autophagosomes. 


Sawa-Makarska et al., Science 369, 1206 (2020) 
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making up almost the entire yeast core ma- 
chinery required for selective autophagy. This 
approach allowed us to exert full control over 
the biochemical reactions and to define the 
organization principles of the early autoph- 
agy machinery. 


RESULTS: We found that Atg9 vesicles and 
proteoliposomes were recruited to the au- 
tophagy cargo via the Atg19 receptor and Atgl1 
scaffold axis. The vesicles in turn recruited 
the Atg2-Atg18 lipid transfer complex and 
the class III phosphatidylinositol 3-phosphate 
kinase complex 1(PI3KC3-C1), which produced 
the signaling lipid phosphatidylinositol 3- 
phosphate (PI3P). PI3P production triggered 
the subsequent recruitment of the PI3P- 
binding protein Atg21, which together with 
the Atg2-Atg18 complex efficiently attracted 
the E3-like Atg12—Atg5-Atg16 complex. Togeth- 
er with the El-like Atg7 and the E2-like Atg3 
proteins, the recruitment of the E3-like com- 
plex ultimately resulted in the conjugation of 
the ubiquitin-like Atg8 protein to the head- 
group of phosphatidylethanolamine (PE) on 
the Atg9 vesicles and proteoliposomes. Atg8 
conjugation is a hallmark of autophagy and 
necessary for membrane expansion. Further- 
more, we discovered that sustained Atg8 con- 
jugation required the Atg2-mediated transfer 
of PE from a donor membrane into Atg9 
proteoliposomes. 


CONCLUSION: We conclude that Atg9 vesicles 
form seeds that establish membrane contact 
sites to initiate the transfer of lipids from 
donor compartments such as the ER. It has 
become increasingly clear that lipid transport 
between different compartments occurs at 
membrane contact sites and that it is medi- 
ated by lipid transfer proteins. Notably, lipid 
transfer at membrane contact sites requires 
two preexisting compartments. We propose 
that during the de novo formation of auto- 
phagosomes, the Atg9 vesicles recruit the 
autophagy machinery and serve as nucleators 
to establish membrane contact sites with a 
donor compartment such as the ER. Atg2- 
mediated lipid transfer in conjunction with 
energy-consuming reactions such as PI3K- 
dependent PI3P production and Atgs lipida- 
tion on the Atg9 vesicles drive net flow of 
lipids into the vesicles, resulting in their ex- 
pansion for autophagosome formation. ® 


The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 
{Corresponding author. Email: justyna.sawa-makarska@ 
univie.ac.at (J.S.-M.); sascha.martens@univie.ac.at (S.M.) 
Cite this article as J. Sawa-Makarska et al., Science 369, 
eaaz7714 (2020). DOI: 10.1126/science.aaz7714 
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CELL BIOLOGY 


Reconstitution of autophagosome nucleation defines 
Atg9 vesicles as seeds for membrane formation 


Justyna Sawa-Makarska‘*}, Verena Baumann, Nicolas Coudevylle’*, Séren von Billow2, 
Veronika Nogellova’, Christine Abert’, Martina Schuschnig’, Martin Graef*“, 


Gerhard Hummer®, Sascha Martens, 


Autophagosomes form de novo in a manner that is incompletely understood. Particularly enigmatic 
are autophagy-related protein 9 (Atg9)—containing vesicles that are required for autophagy machinery 
assembly but do not supply the bulk of the autophagosomal membrane. In this study, we reconstituted 
autophagosome nucleation using recombinant components from yeast. We found that Atg9 proteoliposomes 
first recruited the phosphatidylinositol 3-phosphate kinase complex, followed by Atg21, the Atg2-Atg18 lipid 
transfer complex, and the E3-like Atg12—Atg5-Atg16 complex, which promoted Atg8 lipidation. Furthermore, 
we found that Atg2 could transfer lipids for Atg8 lipidation. In selective autophagy, these reactions could 
potentially be coupled to the cargo via the Atg19-Atg11-Atg9 interactions. We thus propose that Atg9 vesicles 
form seeds that establish membrane contact sites to initiate lipid transfer from compartments such as the 


endoplasmic reticulum. 


utophagy mediates the degradation of 

cytoplasmic material (the cargo) within 

lysosomes and ensures cellular homeo- 

stasis (1). Defects in autophagy have been 

associated with severe pathologies such 
as neurodegeneration, cancer, and infections 
(2). Cargo degradation is achieved by its se- 
questration within double-membrane vesicles 
called autophagosomes. These form de novo 
in an inducible manner and first appear as 
small membrane structures called isolation 
membranes (or phagophores), which gradu- 
ally enclose the cargo as they grow. The assembly 
and growth of the isolation membranes is de- 
pendent on a number of conserved autophagy- 
related (Atg) proteins that act together in a 
hierarchical manner to nucleate and expand 
the isolation membranes (3-5). In yeast, these 
include the Atgl protein kinase complex, ves- 
icles containing the Atg9 protein, the class III 
phosphatidylinositol 3-phosphate kinase com- 
plex 1 (PI3KC3-C1) producing the signaling 
lipid phosphatidylinositol 3-phosphate (PI3P), 
the PI3P-binding PROPPIN proteins, the lipid 
transfer protein Atg2, and the ubiquitin-like 
Atgi2 and Atg8 conjugation systems (Fig. 1A). 
During selective autophagy, the interaction of 
cargo receptors with scaffold proteins directs 
this machinery toward specific cargos (6, 7). The 
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attachment of Atg8 to the membrane lipid 
phosphatidylethanolamine (PE), referred to as 
lipidation, is the most downstream event of 
this cascade. How the biochemical activities of 
the autophagy machinery are orchestrated to 
mediate the formation of autophagosomes is 
not well understood. Especially enigmatic is 
the role of Golgi-derived Atg9 vesicles that are 
required for nucleation of the isolation mem- 
brane but that do not provide the bulk of the 
autophagosomal membrane (8-/7). The bulk 
of the lipids appears to be derived from other 
donor compartments, in particular the endo- 
plasmic reticulum (ER) (12-19). 

Previous work has demonstrated that mem- 
brane contact sites are major mediators of non- 
vesicular lipid flow between compartments 
within the cell (20, 21). The flow of lipids is 
mediated by lipid transfer proteins that ex- 
tract lipids from a donor membrane and trans- 
port them to an acceptor membrane. To 
elucidate how the various activities of the au- 
tophagy machinery act together during the 
nucleation of isolation membranes, we recon- 
stituted a large part of the yeast autophagy 
machinery in vitro. 


Membrane recruitment of Atg12-Atg5-Atg16 
by Atg21 and Atg2-Atg18 

A hallmark of isolation membranes and com- 
pleted autophagosomes is the conjugation of 
the ubiquitin-like Atg8 proteins to the head- 
group of the lipid PE (22, 23). The Atg8 pro- 
teins are required for isolation membrane 
expansion, closure, and cargo selectivity (24). 
The conjugation of Atg8 to PE is mediated by 
the El-like Atg7 and the E2-like Atg3 proteins 
(22) as well as the Atg12-Atg5-Atg16 complex 
that acts in an E3-like manner (25) by activat- 
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ing and localizing Atg8-loaded Atg3 to the 
membrane (26, 27). Thus, the localization of 
the Atg12-Atg5-Atg16 complex is a crucial de- 
terminant of the site of Atg8 lipidation (28). 
Atgi6 binds to the PI3P-binding PROPPIN 
protein Atg21 (29). We sought to determine 
whether this interaction could mediate the 
recruitment of the Atg12-Atg5-Atg16 complex 
to PI3P-containing membranes, such as the 
isolation membrane, and found that Atg21 
bound to PI3P-containing giant unilamellar 
vesicles (GUVs) (Fig. 1B). As expected (27), the 
Atgi2-Atg5-Atg16 complex did not directly 
bind to this lipid composition and was recruited 
only in the presence of Atg21 (Fig. 1B). In cells, 
the PI3P at the pre-autophagosomal structure 
(PAS) recruits another PROPPIN, the Atg18 
protein in complex with the membrane tether- 
ing and lipid transfer protein Atg2 (16, 30-33). 
We examined whether the Atg2-Atg18 com- 
plex could also interact with Atg12-Atg5-Atg16 
and thereby contribute to its recruitment to 
PI3P-positive membranes. Indeed, we detected 
a direct interaction between the two protein 
complexes in a pull-down assay (Fig. 1C). We 
also observed that the presence of Atg2-Atg18 
tended to accelerate the recruitment of the 
Atg12-Atg5-Atg16 complex to PI3P-containing 
GUVs (fig. SIA). Microscopy-based pull-down 
and membrane recruitment experiments indi- 
cated that, as expected, Atg21 bound to the 
Atg12-Atg5-Atg16 complex via Atg16 (fig. S1, 
B and C) (29), while the interaction of Atg2 
was mediated by Atg5 and the interaction of 
Atgi8 required the presence of Atg12 (Fig. 1, 
D to F, and fig. S1D). 

These results suggested the formation of a 
holocomplex on the membrane, containing 
Atg21, Atg12-Atg5-Atg16, and Atg2-Atg18, and 
so we dissected the recruitment of the individ- 
ual components in more detail. Atg21 was the 
main driving force for the recruitment of Atg12- 
Atg5-Atg16 under the conditions tested (Fig. 1G). 
In cells, both PROPPINS (Atg18 and Atg21) and 
Atg2 contributed to the localization of Atg12- 
Atg5-Atgi6 to the PAS (fig. S2) (29). The resi- 
dual recruitment of Atg12-Atg5-Atgi6 in the 
triple-deficient cells could be mediated by the 
Atgl complex (34). In addition, deletion of Atg2, 
Atg18, and Atg21 strongly reduced Atgs lipid- 
ation (fig. S3A), and deletion of any of the three 
proteins stalled the progression of the autoph- 
agic pathway (fig. $3, B and C) (29, 30). 

At the PAS, the PI3KC3-C1 [consisting of the 
vacuolar protein sorting 34 (Vps34), Vps15, 
Atg6, and Atg14: subunits] phosphorylates phos- 
phatidylinositol (PI) to PI3P (35). To address 
whether the recruitment of the Atg12-Atg5- 
Atgi6 complex and Atg8 lipidation could be 
driven by the activity of the PI3KC3-C1 through 
the PI3P-dependent recruitment of Atg2-Atg18 
and Atg21, we added the purified PI3KC3-C1 
to PI-containing GUVs in the presence of Atg21 
and Atg2-Atg18 (Fig. 2A). The Atg12-Atg5-Atg16 
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Fig. 1. Membrane recruitment of the Atg12-Atg5-Atg16 complex by 
PROPPINs. (A) Cartoon showing proteins used in this study. PISKC3-C1 

is labeled as PI3K in all figures. (B) GUVs containing PI3P (57% POPC, 25.5% 
POPS, 15% POPE, 2.5% PI3P; see table S2 for lipid definitions) were incubated 
with either 1 uM Atgl2—-Atg5-Atgl6-mCherry supplemented with 1 uM 
eGFP-Atg21, 1 uM eGFP-Atg21, or 1 uM Atgl2-Atg5-Atgl6-mCherry and 

imaged by microscopy. DIC, differential interference contrast microscopy. 
(C) GFP-Trap pulldown using Atg12—Atg5-Atg16-GFP or GFP as bait and Atg2-Atg18 as 


21 


mEGFP-Atg18 signal intensity (a. u.) 


E 
2500 
2000 
2000 
1500 | 
1500 
1000 
1000 
500 
500 
G Atg21 
Atg2-GFP-Atg18 Atg21 Atg2-GFP-Atg18 Atg21 
Atg12-5-16mCh Atg12-5-16mCh Atg12-5-16mCh Atg2-GFP-Atg18 


a 
Scale bar: 20 um 


and anti-CBP antibodies, respectively. (D to F) Quantification of the pull-down 
experiment mapping the interaction between Atg12—-Atg5-Atg16 and Atg2-Atg18 
shown in fig. SID. The quantification is based on three independent experiments. 
Standard deviations are shown. A schematic representation of the putative 
holocomplex composed of Atgl2—-Atg5-Atg16, Atg2-Atg18, and Atg21 is shown 
as a cartoon insert in (F). a.u., arbitrary units. (G) GUVs of the same lipid 
composition as in (B) were incubated with Atgl2—Atg5-Atg16-mCherry, Atg21, or 
Atg2-GFP-Atg18 at 1 uM final concentration each, and the recruitment of the 


prey. The bait and the prey proteins were detected by im 


complex was recruited to the GUV membrane, 
and this recruitment was dependent on the 
activity of the PI3KC3-Cl (Fig. 2A and fig. S4A). 
Atg21 alone was sufficient to recruit the Atg12- 
Atg5-Atg16 complex and to induce Atg§ lipida- 
tion on the GUVs (Fig. 2B). These effects were 


munoblotting with anti-GFP 


(Fig. 2B). We interpreted the localization of 
green fluorescent protein (GFP)-Atg8 on the 
membrane as lipidation because it was abol- 
ished when using a nonconjugatable form of 
Atg8 (GFP-Atg8-6xHis) and it strictly depended 
on the presence of the conjugation machinery 


enhanced when Atg2-Atg18 was also present 
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Atg7 and Atg3 (fig. S4B). 
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proteins to the membrane was imaged by microscopy. 


Reconstitution of Atg8 lipidation on 

Atg9 proteoliposomes 

Autophagosome nucleation depends on the pres- 
ence of Atg9 vesicles (8-17). In Saccharomyces 
cerevisiae, a few of these vesicles translocate 
to the autophagosome formation site (8). Be- 
cause Atg9 is required for the recruitment of 
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Fig. 2. In vitro reconstitution of PIS3KC3-C1—dependent Atg8 lipidation. 

(A) The Atg8-PE conjugation machinery (Atg7, Atg3, Atgl2—-Atg5-Atg16-mCherry, 
and GFP-Atg8AR117) and PROPPINs (Atg21 and Atg2-Atg18) were added to GUVs 
(55% DOPC, 10% DOPS, 17% DOPE, 18% liver PI) and incubated in the presence or 
absence of PI3KC3-C1 and cofactors (ATP, MnCle, MgCle, and EGTA). Microscopy 
images of representative GUVs are shown. The proteins included in the experiment 
are depicted in the cartoon inserts. (B) Atg8 lipidation to GUVs depends on the 
presence of Atg21. GUVs were incubated with Atg8—PE conjugation machinery 
proteins as in (A) and PI3KC3-C1 in the presence of either one or both PROPPINs. 
The quantification of the GFP signal on GUVs from three independent experiments is 
shown to the left. (€) Atgl2-Atg5-Atg16 recruitment to Atg9 PLs depends on the 
activity of PISKC3-Cl. GFP-Trap beads were coated with Atg9-EGFP PLs and 
incubated with Atg21, Atg2-Atg18, and Atgl2—-Atg5-Atgl6-mCherry in the presence 
or absence of PISKC3-C1 and ATP or in the presence of PISKC3-C1 and AMP-PNP. 


the PI3KC3-C1 to the site of autophagosome 
formation (36), we wondered whether the Atg9 
vesicles could serve as platforms for the as- 
sembly of the autophagy machinery and there- 
by nucleate autophagosome formation. To this 
end, we reconstituted the purified Atg9 pro- 
tein into small unilamellar vesicles (SUVs) to 
form proteoliposomes (PLs) (fig. S5, A to D). 
To mimic the natural lipid composition of 
these vesicles, we isolated Atg9 vesicles from 
S. cerevisiae and determined their lipid com- 
position by lipidomics (fig. S6A). The vesicles 
had a high PI content (44%) (fig. S6B) (37), 
suggesting that they should be particularly 
good substrates for the PI3KC3-C1. To test this, 
we tethered PLs containing Atg9-enhanced 
green fluorescent protein (EGFP) to GFP-Trap 
beads to image the recruitment of other fac- 
tors by microscopy. The membrane of the PLs 
was labeled by incorporation of a blue mem- 
brane dye (ATTO390-DOPE). Upon incubation 
of the vesicles with the PI3KC3-C1, Atg21, Atg2- 
Atgi8, and the Atg12—Atg5-Atg16 complex, we 
observed recruitment of Atg12-Atg5-Atg16 to 
the Atg9 PLs (Fig. 2C). Consistent with the re- 
sults above (Figs. 1G and 2B), recruitment was 
strongest in the presence of both Atg2-Atg18 
and Atg21 (Fig. 2D). We then added Atg7 and 
Atg3 to the reaction (now containing 14 poly- 
peptides) to test whether Atg8 could be conju- 
gated to the Atg9 PLs in a manner that depends 
on PI3KC3-Cl, Atg21, Atg2-Atg18, and Atg12- 
Atg5-Atgi6. We observed efficient Atg8 lipida- 
tion to the Atg9 PLs (Fig. 2E). Reduction of the 
Atg8 signal upon addition of the wild-type de- 
lipidating enzyme Atg4 but not its catalytic 
mutant (fig. S7A) showed that the detected 
mCherry-Atg8 signal at the beads was indeed 
attributable to lipidation. 

Analogous to the results we observed for 
Atg12-Atg5-Atg16 recruitment, Atg8 conjuga- 
tion was relatively independent of the Atg2- 
Atgi18 complex and was also weakly detectable 
in the absence of the PI3KC3-C1 (Fig. 2F and 
fig. S7, B and C), likely because Atg21 shows 
residual binding to PI-containing membranes. 
These results suggested a division of labor be- 
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tween Atg21 and Atg2-Atg18, where Atg21 plays 
a major role in the initial recruitment of Atg12- 
Atg5-Atg16, and the main function of Atg2- 
Atg18 could be membrane tethering and lipid 
transfer (16, 30-33). 


Reconstitution of autophagosome nucleation 
in selective autophagy 


In selective autophagy, autophagosome nucle- 
ation must be coupled to the presence of cargo 
material (7). The cargo is recognized by cargo 
receptors such as p62 in human cells and Atg19 
in S. cerevisiae. These cargo receptors link the 
autophagy machinery to the cargo via the 
FIP200/Atgi1 proteins (6). Atg11 was shown to 
interact with Atg9 (38, 39). We purified full- 
length Atgl1 and, in agreement with (40) but 
in contrast to (39), found Atgl1 to be a con- 
stitutive dimer (fig. SSB). Atg11 bound directly 
to the N terminus of Atg9 (fig. S8C). Next, we 
examined whether the Atgl19 cargo receptor 
could recruit the autophagy machinery, includ- 
ing Atg9 vesicles, to the cargo and subsequently 
initiate Atg8 conjugation. The cargo was mi- 
micked by attachment of the GST-prApel pro- 
peptide (residues 1 to 41) to glutathione beads. 
These were incubated with the Atg19 cargo 
receptor and subsequently with Atg11. Atg11 
was recruited to the beads in an Atg19-dependent 
manner. The recruitment was enhanced 
when a phospho-mimicking mutant of Atg19 
[Ser239;79°_, Asp (S390D, S391D, and S396D)] 
(41) was used (fig. S8A). Atg9 PLs and Atg9 
vesicles isolated from cells (fig. S9) bound to 
the cargo beads in an AtgI1-dependent manner 
(Fig. 3, A and B, and fig. S8D). When we added 
the PI3KC3-Cl, Atg2-Atg18, Atg21, Atg12-Atg5- 
Atgi6, Atg3, Atg7, and Atg8 to the Atg9 PLs 
bound to the cargo beads—a reaction now con- 
taining almost the entire autophagy machinery— 
Atg8 was efficiently lipidated and anchored to 
the Atg9 PLs (Fig. 3C). Isolated Atg9 vesicles 
could also serve as substrates for the lipidation 
reaction (Fig. 3D), although the lipidation was 
markedly less prominent on the vesicles than 
on the reconstituted PLs (fig. SIOA). The Atg8 
signal on the Atg9 vesicles was attributable to 
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Microscopy images of representative beads are shown. (D) Beads as in (C) were 
incubated with Atgl2—-Atg5-Atg]16-mCherry and PI3KC3-C1 in the presence of 
either one or both PROPPINs. The quantification of mCherry signal on beads from 
three independent experiments is shown to the left. (E) Reconstitution 

of Atg8 lipidation to Atg9 PLs. Beads as in (C) were incubated with PI3KC3-Cl, 
ATP, Atg21, Atg2-Atgl8, mCherry-Atg8ARI117, Atg7, Atg3, and Atgl2-Atg5-Atg16, 
each time omitting one of the Atg8—PE conjugation machinery proteins, as 
indicated above the microscopy images of representative beads. (F) Atg8 
lipidation to Atg9 PLs depends on the presence of Atg21. Beads as in (C) were 
incubated with PISKC3-C1, ATP, mCherry-Atg8AR117, Atg7, Atg3, and Atgl2- 
Atg5-Atgl6 in the presence of either one or both PROPPINs. The quantification 
of mCherry signal on the beads from three independent experiments is shown 
to the left. Significance is indicated using P values from Student's t test: *P < 0.05, 
**P < 0.01, ***P < 0.001. 


lipidation because it depended on the Atg12- 
Atg5-Atg16 complex (Fig. 3D) and decreased 
upon addition of Atg4 (Fig. 3E). Thus, the au- 
tophagy machinery can be redirected toward 
the cargo via the cargo receptor (Atg19)-scaffold 
(Atg11)-Atg9 axis (Fig. 3 and fig. S1OB). The 
Atg1-Atg13 complex was also recruited to these 
beads (fig. S10C). Thus, Atg11 and Atg9 vesicles 
are sufficient to recruit (almost) the entire 
autophagy machinery to the cargo. 


Atg9 vesicles as acceptors for lipid transfer by Atg2 
Owing to their small size, Atg9 vesicles provide 
only limited surface for Atg8 lipidation and 
isolation membrane expansion. Furthermore, 
in addition to Atg9, these vesicles contain 
other proteins, which further reduce the ef- 
fective surface for lipidation. This is consistent 
with our finding that Atg9 vesicles were less 
efficient substrates for Atg8 lipidation than 
Atg9 PLs (Fig. 3 and fig. SIOA). To estimate 
the available membrane surface of these ves- 
icles, we built a three-dimensional model of 
an Atg9 vesicle (Fig. 4A and movie S1). We 
based this model on an average diameter of 
60 nm (fig. S9) (8), our proteomics data (fig. 
S6C and data S1), and an average of 28 Atg9 
molecules per vesicle (8). In addition, we placed 
one molecule each of PI3KC3-Cl1, Atg2-Atg18, 
Atg21, Atg12—-Atg5-Atg16, and Atg3 loaded with 
Atg8 on the vesicular membrane (see Mate- 
rials and methods section for details). With 70 
proteins present in the modeled Atg9 vesicle, 
the accessibility of the membrane would be 
very limited. We calculated an effective dynam- 
ic surface coverage of 82% of the membrane 
area. Given that peripheral membrane pro- 
teins may have been lost during the isolation, 
the very stringent selection of proteins from 
mass spectrometric data used for modeling, 
and the fact that we assumed the Atg9 N and 
C termini not to interact with the vesicular 
membrane, the actual free surface may be even 
lower and more difficult to reach for incoming 
proteins. Thus, Atg9 vesicles may require lipid 
influx to transform into an efficient substrate 
for Atg8 lipidation. 
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the cargo-mimetic beads. Glutathione sepharose beads were coated with 
GST-prApel (1-41), Atg19-3D, and Atg11, incubated with Atg9-mCherry PLs (C) 
or Atg9-EGFP vesicles (D), washed with buffer, and incubated with PI3KC3-C1, 
ATP, Atg21, Atg2-Atg18, Atg3, Atgl12-Atg5-Atgl6, eGFP-Atg8AR117 (C) or 
mCherry-Atg8AR117 (D) and with or without Atg12-Atg5-Atg16 (see cartoons 
for the experimental setup). Microscopy images of representative beads are 
shown. (E) Time course experiment of the Atg8-deconjugation reaction on Atg9 
vesicles. Atg4 wild type or the Atg4 C147S inactive mutant were added to the 
beads, as in (D). Microscopy images were taken at the indicated time points 
after the addition of Atg4. 
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Fig. 4. Atg2-mediated lipid transfer into Atg9 PLs. (A) Molecular model of an _— from a wild-type yeast strain. (©) Coomassie-stained gels showing Atg8-PE 
endogenous Atg9 vesicle. The model contains the following proteins (copy conjugation assays using the depicted experimental setup. Atg8—PE conjugation 


numbers in parentheses): Atg9 (28), Atg27 (10), Atg23 (10), and the SNAP 
receptors (SNAREs) SFT1 (1), TLG1 (1), VTI1 (1), SSO1 (1), and GOS1 (1). Copy 
numbers are based on literature and mass spectrometry analysis of isolated 
Atg9 vesicles (see main text, fig. S6C, and methods section). Single copies of 
membrane-interacting autophagy proteins (PI3KC3-Cl, Atg21, Atg2-Atg18, Atg3, 
Atgl2—Atg5-Atg16, and Atg8) were additionally positioned on the surface of 

the Atg9 vesicle. Atg9-NTD and Atg9-CTD indicate N-terminal and C-terminal 
domains, respectively. Lipid headgroups are shown as small tan spheres. 

(B) Atg2-Atg18 is recruited to Atg9 vesicles and cargo-mimetic beads. GFP-Trap 
beads were coated with endogenous Atg9-EGFP vesicles. Glutathione sepharose 
beads were coated with GST-prApel (1-41), Atg19-3D, Atgl1, and Atg9-EGFP 
vesicles and incubated with Atg2-mCherry-Atg18. Mock membranes were derived 
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ng of NBD fluorescence. 
uorescence intensity at 


was detected as a band shift. Numbers above the gels i 
(D) Phospholipid transfer assay based on the dequenchi 
Fay and Fg) represent the nitrobenzoxadiazole (NBD) f 
each time point after and before addition of Atg2-Atg18, respectively, measured 
at 535 nm. Atg9 PLs were used as acceptor liposomes. Data are the mean values 
from five independent experiments. SD is shown. (E) Anti-Atg8 immunoblots 
showing Atg8-PE conjugation assays mediated by lipid transfer of Atg2-Atg18. 
The arrow indicates the Atg8 signal after pulling down Atg9-EGFP with GFP-Trap 
beads. [(C) and (E)] Quantification shows the averaged Atg8-PE/Atg8 ratio for 
each time point. Error bars represent SD. The quantification is based on four 
independent experiments. P values were calculated using Student's t test. 
Significance is indicated with *P < 0.05 and **P < 0.01 n.s., not significant. 
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vesicle 


Progression of autophagosome formation 


Fig. 5. Model for the initial steps of the isolation membrane generation. (A) Recruitment of Atg9 vesicles to the prApel cargo via the Atgl9 receptor and 
Atgll scaffold axis. The Atg9 vesicles recruit Atg2-Atgl8 and PI3KC3-C1 (labeled PI3K). Production of PI3P by PI3KC3-C1 recruits Atg21 and the E3-like Atgl2-Atg5- 
Atgl6 complex. The membrane-positioned E3-like complex directs Atg8—PE conjugation to the vesicle. Atg8 lipidation is sustained by Atg2-mediated lipid transfer 
from a donor compartment such as the ER. (B and C) Lipid influx expands the vesicle surface resulting in isolation membrane expansion. 


The lipid transfer protein Atg2 is recruited 
to the Atg9 vesicles (Fig. 4B) and tethers Atg9 
to the ER in cells (16). The interaction between 
ATG2A and ATG9QA is important for isolation 
membrane expansion in mammalian cells (42). 
Atg2-mediated lipid transfer from the ER into 
the membrane of the Atg9 vesicle may there- 
fore enable Atg8 lipidation and subsequent 
expansion of the spherical Atg9 vesicles, con- 
verting them into the disk-shaped isolation 
membranes. 

To test whether Atg2 can transport lipids for 
Atg8 conjugation, we mixed two populations 
of liposomes. One population (SUV A) con- 
tained a lipid composition that efficiently re- 
cruited the lipidation machinery (27) but did 
not contain PE as substrate for Atg8 conjuga- 
tion. The other population (SUV B) contained 
PE but was not efficiently targeted by the lipid- 
ation machinery (Fig. 4C and fig. S11C). Upon 
addition of Atg2-Atg18, which is active in lipid 
transport (fig. S11, A and B), we detected a sig- 
nificantly increased lipidation of Atg8, dem- 
onstrating that Atg2-Atg18 could directly enhance 
Atg8 lipidation (Fig. 4C). Because phosphati- 
dylserine (PS) can also serve as substrate for 
Atg8 lipidation in vitro (43), the actual stimu- 
latory effect of Atg2-Atg18 on Atg8 lipidation 
may be even higher. To exclude the possibility 
that Atg2-Atg18 allosterically activated the E3 
by direct binding, we conjugated Atg8 to PE- 
containing SUVs in the presence or absence of 
Atg2-Atg18 and found that we could not ob- 
serve significant differences in Atg8 lipidation 
(fig. S11D). Atg9 PLs also served as acceptors 
for Atg2-mediated lipid transport (Fig. 4D). 
We therefore sought to determine whether the 
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lipids transported into Atg9 PLs could serve as 
substrates for Atg8 lipidation. Atg9 PLs lack- 
ing PE and PS were mixed with a second pop- 
ulation of liposomes containing these lipids. 
We then added Atg2-Atgi8 in the presence of 
the PI3KC3-Cl, Atg21, Atg12-Atg5-Atg16, Atg7, 
Atg3, and Atg8 (Fig. 4E). We found that Atg8 
lipidation, as monitored by immunoblotting, 
was accelerated in the presence of Atg2-Atg18 
(Fig. 4E). To confirm that Atg8 lipidation oc- 
curred on the Atg9 PLs, we pulled down the 
Atg9 PLs using GFP-Trap beads and found 
lipidated Atg8 only in the presence of Atg2- 
Atgi8 (Fig. 4E, arrow in top immunoblot). 


Outlook 


Here, we present a near-full in vitro reconsti- 
tution of the events occurring during auto- 
phagosome nucleation in selective autophagy. 
Specifically, we demonstrate that Atg9 vesicles 
are substrates of PIZKC3-C1 and that the PI3P 
generated in situ mediates the successive re- 
cruitment of Atg21, Atg2-Atg18, and the Atg12- 
Atg5-Atg16 complex as prerequisites for the 
subsequent Atg§8 lipidation. 

The role of Atg9 vesicles has remained mys- 
terious. They are required for early steps of 
autophagosome formation but make up only a 
minor fraction of the lipids required to form 
the autophagosomal membrane (8-17). Auto- 
phagosomes are generated in proximity to the 
ER, but their membranes are clearly distinct 
from the ER membrane (13-19). Our results 
show that Atg9 vesicles form a platform for the 
recruitment of the autophagy machinery. Among 
them is the membrane tethering and lipid trans- 
fer protein Atg2 (16, 30-33), which can trans- 


4 September 2020 


fer lipids at a rate that enables it to be a major 
contributor to isolation membrane expansion 
(44). It has become clear that lipid transfer at 
membrane contact sites provides the commu- 
nication and membrane flow between intracel- 
lular compartments. However, lipid transfer 
can only occur between existing donor and ac- 
ceptor compartments. Atg9 vesicles may thus 
form seeds for the initial establishment of 
membrane contact sites. Therefore, quantita- 
tive Atg8 lipidation may only occur after lipid 
influx from the ER into the Atg9 vesicle, grad- 
ually converting it into the disk-shaped isola- 
tion membrane (Fig. 5). In this manner, Atg9 
vesicles could seed a biochemically distinctive 
membrane, the isolation membrane, largely 
devoid of transmembrane proteins (45, 46). 
To ensure the expansion of the isolation mem- 
brane, the incoming lipids must be distributed 
to its inner leaflet, an action that would re- 
quire flippase or scramblase activity. Notably, 
we found two flippases (Drs2 and Neol) pres- 
ent in our Atg9 vesicle proteomics analysis. 
Multiple individual nucleation events followed 
by ESCRT (endosomal sorting complexes re- 
quired for transport)-mediated membrane seal- 
ing may be required for the formation of larger 
autophagosomes (47-49). 

In addition, the Golgi-derived Atg9 vesicles 
isolated from cells might be tightly packed with 
proteins. The influx of loosely packed lipids 
from the ER might thus render them good sub- 
strates for subsequent Atg8 lipidation apart 
from the expansion of the free membrane area. 
In fact, autophagosomal membranes contain 
a high proportion of lipids with unsaturated 
fatty acids (12). Apart from serving as acceptors 
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for lipid influx, Atg9 vesicles may also kickstart 
local lipid synthesis (72). Accordingly, we found 
Faal and Faa4 in our Atg9 vesicle proteomics. 
During selective autophagy, cargo material 
is specifically sequestered by autophagosomes. 
It has become clear that cargo receptors act 
upstream of the autophagy machinery by re- 
cruiting scaffold proteins to the cargo (50-56). 
Here, we fully reconstitute the cargo receptor 
and scaffold dependent recruitment of the 
autophagy machinery to the cargo material 
and demonstrate that this system is sufficient 
to promote local Atg8 lipidation. Future work 
will reveal how the recruitment of the autoph- 
agy machinery, including the Atg9 vesicles, is 
sterically and temporally coupled to the forma- 
tion of membrane contact sites with the ER. 


Materials and methods summary 


The full version of the materials and methods 
is available in the supplementary materials. 


Protein expression and purification 


Atgi9 (residues 1 to 374) and the Atg19-3D and 
Atg19-3DALIR mutants were expressed and 
purified as described elsewhere (57, 58). mMEGFP/ 
mCherry-Atg8-AR117 was expressed and puri- 
fied as described in (27). 

6xHis-TEV-Atg21, 6xHis-TEV-mEGFP-Atg21, 
6xHis-TEV-mCherry-Atg21, 6xHis-Atg18-mEGFP, 
and Atg9-NTD(1-285)-mEGFP were all expressed 
in E. coli Rosetta pLyss. 

Atg2-Atg18-CBP (CBP, calmodulin binding pro- 
tein), Atg2-GFP-Atg18-CBP, and Atg2-mCherry- 
Atg18-CBP were purified from the SMY373, SMY374, 
and SMY439 yeast strains, respectively. 

6xHis-TEV-Atg2-mEGFP, PI3KC3-Cl, protA- 
TEV-Atgl1-Atg13, 6xHis-TEV-mEGFP/mCherry- 
Atgll, and 6xHis-TEV-Atg9- mEGFP/mCherry 
were all expressed in the baculovirus expres- 
sion system. 

All soluble proteins were purified via affi- 
nity chromatography followed by size-exclusion 
chromatography. 

For full length Atg9-mEGFP/mCherry, cell 
membranes were collected by centrifuging the 
cleared cell lysate at 40,000 revolutions per 
minute (rpm) for 1 hour. The membranes were 
resuspended for 2 hours at 4°C in lysis buffer 
containing 2% n-dodecyl B-D-maltoside (DDM). 
After 2 hours of incubation, the insoluble 
material was removed by centrifugation at 
40,000 rpm for 1 hour. Atg9 was then purified 
by affinity chromatography followed by size- 
exclusion chromatography in the presence 
of 0.2% DDM. To concentrate the protein 
without increasing the detergent concentra- 
tion, the fractions containing protein were in- 
cubated with 150 ul of nickel nitrilotriacetic 
acid (NiNTA) beads for 3 hours at 4°C. The 
beads were washed several times with 25 mM 
Tris pH 7.4, 300 mM NaCl, 0.04% DDM. The 
protein was eluted in the desired volume of 
buffer supplemented with 300 mM imidazole. 
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A final dialysis was performed overnight at 4°C 
against 25 mM Tris pH 7.4, 300 mM NaCl, 
0.04% DDM. 


Atg9 PLs formation and analysis 


Small unilamellar vesicles (SUVs; i.e., liposomes) 
destined for the reconstitution of Atg9 PLs 
were prepared with a lipid composition mimick- 
ing the lipid composition of the endogenous 
Atg9 vesicles determined in this study (for 
details, see table S2). For the incorporation 
of Atg9, the SUVs were treated with 3-[(3- 
cholamidopropyl)dimethylammonio]-1-propane- 
sulfonate (CHAPS) (Avanti Polar Lipids, Inc.). 
The SUV suspension was brought up to 2.5% 
CHAPS and incubated at room temperature 
(RT) for 1 hour. The SUV suspension was then 
mixed at a 1:1 ratio with a 1-uM Atg9 solution 
in 0.04% DDM. The mixture was incubated at 
RT for another 90 min and then diluted by a 
factor of 10 in Tris 25 mM Tris pH 7.4, 300 mM 
NaCl to reach a detergent concentration below 
the critical micelle concentration (CMC) of 
both detergents. The resulting PL solution was 
dialyzed overnight at 4°C against 25 mM Tris 
pH 7.4, 300 mM NaCl supplemented with 0.1 g 
of BioBeads SM2 (BioRad) per liter of buffer. 
Finally, BioBeads were added directly to the 
sample and incubated for 1 hour at RT. The 
insoluble material that did not get incorpo- 
rated into liposomes was removed by centri- 
fuging 30 min at 18,000 rpm. The supernatant 
containing Atg9 PLs was collected and used 
for subsequent experiments. 


Membrane recruitment—GUV assays 


To image Atg21, Atg2-Atg18, and Atg12-Atg5- 
Atgi6 membrane recruitment, 15 ul of the elec- 
troformed GUVs were transferred to a 96-well 
glass-bottom microplate (Greiner Bio-One), 
and the respective proteins were added to the 
final concentration of 1 uM in a final reaction 
volume of 30 pl in a reaction buffer 25 mM 
4-(2-hydroxyethyl)-1-piperazineethanesulfonic 
acid (HEPES) at pH 7.5, 150 mM NaCl. In 
every experiment involving GUVs, before the 
GUVs and proteins were pipetted onto the 
plate, the wells were blocked with a blocking 
solution [2.5 mg/ml bovine serum albumin 
(BSA) in 50 mM TrisHCl pH 7.4, 150 mM NaCl] 
for 1 hour and washed twice with the reac- 
tion buffer. 

For Atg21, Atg2-Atg18, and Atg12—Atg5-Atg16 
membrane recruitment in the presence of 
PI3KC3-Cl experiments, mixes containing respec- 
tive proteins, 0.1 mM adenosine triphosphate 
(ATP) or 0.1 mM adenylyl-imidodiphosphate 
(AMP-PNP), 0.5 mM MgClo, 2 mM MnCl, and 
1 mM egtazic acid (EGTA) in a final volume of 
15 ul were prepared. The final concentration of 
proteins in the reaction mixes were 50 nM for 
PI3KC3-Cl1, 400 nM for Atg21, 400 nM for 
Atg2-GFP-Atg18, and 40 nM for Atg12-Atg5- 
Atgi6-mCherry. The reaction mixes were ad- 


4 September 2020 


ded to the well already containing 15 ul of the 
electroformed GUVs. For the time course ex- 
periment, the imaging started 5 min after the 
addition of the reaction mix to GUVs. The 
images were acquired for 45 min at the indi- 
cated time points of reaction. 


In vitro reconstitution of Atg8 lipidation on GUVs 


To image the PI3KC3-Cl-dependent Atg8-PE 
conjugation to GUVs, mixes containing re- 
spective proteins (according to the experimen- 
tal setup), 0.5 mM ATP, 0.5 mM MgCl», 2 mM 
MnCl, and 1 mM EGTA in a final volume of 
15 ul were prepared. The reaction buffer con- 
tained 25 mM HEPES at pH 7.5, 150 mM NaCl. 
The final concentrations of proteins in the re- 
action mixes were 50 nM for PI3KC3-C1, 400 nM 
for Atg21, 400 nM for Atg2-Atg18, 40 nM for 
Atgi12-Atg5-Atg16-mCherry, 80 nM for Atg7, 
80 nM for Atg3, 400 nM GFP-Atg8AR117, and 
400 nM GFP-Atg8-6xHis. The reaction mixes 
were added to wells of a 96-well glass-bottom 
microplate (Greiner Bio-One) already contain- 
ing 15 wl of the electroformed GUVs. Concen- 
trations of proteins and cofactors used were 
calculated for the final 30 ul volume of the 
experiment. 


Microscopy-based protein-protein 
interaction assay 


For the experiments shown in Fig. 2, B and G, 
giant unilamellar vesicles (GUVs) were pre- 
pared. Preparation was carried out as described 
above. Assays were performed under equilibrium 
conditions, and mEGFP-Atg21, 6xHis-Atg21, 
Atgl2-Atg5-Atg16-mCherry, and Atg2-GFP-Atg18- 
CBP were added at a final concentration of 
500 nM. 

For Fig. 1, D to F, Atg12-Atg5-Atg16-mCherry, 
Atg5-mCherry-Atg16(1-46), and Atg16-mCherry 
were recruited to red fluorescent protein (RFP)- 
TRAP beads (Chromotek). Assays were per- 
formed under equilibrium conditions with 
2 uM of the prey proteins Atg2-GFP-Atg18-CBP, 
Atg2-mEGFP, and Atg18-mEGFP. 


Isolation of endogenous Atg9 vesicles 


To isolate endogenous Atg9 vesicles, we cloned 
versions of Atg9 tagged with a fluorophore 
(nEGFP or mCherry) and a tobacco etch virus 
(TEV) cleavable affinity tag (9xmyc or TAP). 
These constructs were used to replace the 
endogenous ATG9 gene in haploid BY474x 
S. cerevisiae cells, putting the expression under 
the control of the endogenous A7G9 promoter. 
Constructs were then integrated into wild type 
or pep4A strains. 

Strains were grown, harvested, and lysed. 
Cleared cell lysate was incubated with the ap- 
propriate affinity beads (coated with either 
immunoglobulin G or anti-myc antibody) at 
4°C for 1 hour. The beads were then washed, the 
vesicles were released by TEV cleavage at 4°C 
for an hour, and the supernatant was collected. 
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In vitro reconstitution of Atg8 lipidation on Atg9 
PLs or Atg9 vesicles bound to cargo-mimetic beads 
Assembly of the cargo-mimetic beads 


Glutathione sepharose 4B beads (GE Health- 
care) were first equilibrated in 25 mM Tris pH 
7.4, 300 mM NaCl. Beads were mixed with the 
same volume of a 30-1M solution of GST-prApel 
(1-41), 30-uM solution of Atg19-3DALIR mutant, 
and 30 uM of Atgll. The mixture was incubated 
for 1 hour at 4°C, and the beads were subse- 
quently washed three times. 


Recruitment of Atg9 PLs or Atg9 vesicles 
to the cargo-mimetic beads 


Ten microliters of cargo-mimetic beads were 
mixed with either 200 ul of Atg9-mCherry PLs 
solution or an equal volume of TEV-eluted 
Atg9-EGFP vesicles. The mixture was incu- 
bated for 2 hours at 4°C, and the beads were 
subsequently washed once 


In vitro Atg8 lipidation 


Five tenths of a microliter of cargo-mimetic 
beads coated with Atg9-mCherry PL or Atg9- 
EGFP vesicles were pipetted into the wells of 
a 384-well glass-bottom microplate (Greiner 
Bio-One) containing 0.5 mM ATP, 0.5 mM 
MgCly, 2mM MnClo, and 1 mM EGTA in a final 
volume of 15 ul. The final concentrations of 
proteins in the reaction mixes were 50 nM for 
PI3KC3-Cl1, 400 nM for Atg21, 400 nM for 
Atg2-Atg18, 40 nM for Atg12-Atg5-Atg16, 100 nM 
for Atg7, 100 nM for Atg3, and 400 nM for 
EGFP-Atg8AR117 (200 nM of mCherry-Atg8AR117 
for Atg9 vesicles). The reactions were incu- 
bated for 2 hours at RT in the dark, and the 
beads were imaged using confocal microscope 
LSM700 (Zeiss) with 20x objective and pro- 
cessed with ImageJ software. 

To deconjugate Atg8 from Atg9 vesicles, 
Atg4 or Atg4C147S was added at a final con- 
centration of 0.5 uM together with EDTA at a 
final concentration of 2 mM, and microscopy 
images were taken at the indicated time points. 
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INTRODUCTION: The ability to regenerate tissues 
lost to damage or disease is widely but nonuni- 
formly distributed in vertebrates. Some animals 
such as teleost fishes can regenerate a variety of 
organs, including amputated appendages, heart 
ventricles, and the spinal cord, whereas others 
such as mammals cannot. Even though regener- 
ation has been the subject of extensive phyloge- 
netic, developmental, cellular, and molecular 
studies, the mechanisms underlying the broad 
disparity of regenerative capacities in animals 
remain elusive. Changes in cis-regulatory ele- 
ments have been shown to be a major source 
of morphological diversity. Emerging evidence 
indicates that injury-dependent gene expres- 
sion may be controlled by injury-responsive 
enhancer elements. However, ablations of these 
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previously characterized elements from the 
zebrafish (Danio rerio) and Drosophila have 
shown that they are generally dispensable 
for regeneration. Therefore, whether conserved 
regeneration-responsive, rather than injury- 
responsive, elements exist in vertebrate genomes 
and how they evolved remain to be conclusively 
demonstrated. 


RATIONALE: Identification of conserved 
regeneration-responsive enhancers (RREs) 
requires two related but evolutionarily dis- 
tant species that are capable of regeneration. 
The dramatic differences in life history and 
the ~230 million years of evolutionary dis- 
tance between the zebrafish and the African 
killifish Nothobranchius furzeri provide a 
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RREs and vertebrate regeneration. Comparative H3K27ac ChiP-seq, bulk RNA-seq, and scRNA-seq of two distantly 
related teleost species (African killifish and zebrafish) during the early stages of regeneration helped to identify evolu- 
tionarily conserved RREs active in blastemal cells. Systematic transgenic reporter assays validated the putative RREs and 
helped to identify species-specific variations of an RRE essential for killifish regeneration. Our study provides a testable 
hypothesis based on enhancer repurposing to explain the uneven distribution of regenerative capacities in vertebrates. 
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unique biological context in which to distinguish 
between species-specific and conserved RREs. 
We reasoned that applying histone H3K27ac 
chromatin immunoprecipitation sequencing 
(ChIP-seq, a marker for active enhancers), bulk 
RNA sequencing (RNA-seq), and single-cell 
RNA-seq (SscRNA-seq) would identify RREs 
activated by amputation and help to determine 
their target gene expression at the single-cell 
level. Furthermore, we took advantage of the 
fast sexual maturation of African killifish to 
rapidly generate transgenic reporter assays to 
validate predicted RREs and to facilitate their 
functional testing in adult regeneration. 


RESULTS: We uncovered both large differences 
in the genomic responses to amputation in 
killifish and zebrafish and an evolutionarily 
conserved teleost regeneration response pro- 
gram (RRP), which is mainly deployed by 
regeneration-specific blastema cells. Bioinfor- 
matic analyses revealed that activation of the 
RRP, which includes known effectors of re- 
generation in zebrafish such as inhibin beta A 
(inhba), was differentially activated in mam- 
mals that are robust (Acomys cahirinus) and 
weak regenerators (Mus musculus). Functional 
testing by systematic transgenic reporter as- 
says of the conserved inhba RRE from killifish, 
zebrafish, and humans identified species-specific 
variations. Deletion of the killifish inkba RRE 
significantly perturbed caudal fin regeneration 
and abrogated cardiac regeneration. Further- 
more, inhba RRE activity required the presence 
of predicted binding motifs for the activator 
protein 1 (AP-1) complex. Lastly, AP-1-binding 
motifs can be identified in the conserved and 
nonconserved teleost RREs reported in this 
study, indicating that AP-1 may be required 
for both injury and regeneration responses. 


CONCLUSION: We propose an RRE-based model 
for the loss of regenerative capacities during evo- 
lution. In our model, the ancestral function for 
AP-1-enriched RREs was to activate a regen- 
erative response that included both injury and 
regeneration. Through the course of evolution 
and speciation, regeneration and injury responses 
became dissociated from each other in some but 
not all enhancers. In extant species, regeneration- 
competent animals maintain the ancestral en- 
hancer activities to activate both injury and 
regeneration responses, whereas in regeneration- 
incompetent animals, repurposing of ances- 
tral enhancers may have led to the retention of 
injury response activities but to the loss of the 
regeneration response. 
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Vertebrates vary in their ability to regenerate, and the genetic mechanisms underlying such disparity 
remain elusive. Comparative epigenomic profiling and single-cell sequencing of two related teleost 

fish uncovered species-specific and evolutionarily conserved genomic responses to regeneration. The 
conserved response revealed several regeneration-responsive enhancers (RREs), including an element 
upstream to inhibin beta A (inhba), a known effector of vertebrate regeneration. This element activated 
expression in regenerating transgenic fish, and its genomic deletion perturbed caudal fin regeneration 

and abrogated cardiac regeneration altogether. The enhancer is present in mammals, shares functionally 
essential activator protein 1 (AP-1)-binding motifs, and responds to injury, but it cannot rescue 
regeneration in fish. This work suggests that changes in AP-1-enriched RREs are likely a crucial source 


of loss of regenerative capacities in vertebrates. 


egeneration in response to tissue dam- 

age is not uniformly distributed in verte- 

brates (7). For instance, teleost fishes and 

salamanders can regenerate a variety 

of organs, including amputated append- 
ages, heart ventricle, and spinal cord, whereas 
mammals have relatively little regenerative 
capability (2, 3). Moreover, the ability to re- 
generate is generally limited to only early de- 
velopmental stages in certain species (4, 5). 
Changes in cis-regulatory elements or enhanc- 
ers are a major source of morphological diver- 
sity (6, 7). Emerging evidence suggests that 
the activation of injury-dependent gene ex- 
pression may be directed by injury-responsive 
enhancer elements (8). Two such elements, 
the leptin-b (lepb) enhancer in the zebrafish 
(Danio rerio) and the WNT gene cluster BRV1I8 
enhancer in the fruit fly Drosophila melano- 
gaster, modulate gene expression after injury. 
However, ablation of lepb in zebrafish or the 
fly WNT enhancer has shown these injury- 
responsive components to be generally dis- 
pensable for regeneration (8, 9). Therefore, 
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whether conserved regeneration-responsive, 
rather than injury-responsive, elements exist 
in vertebrate genomes and how they evolve 
have not been conclusively demonstrated. 
The identification of enhancers across spe- 
cies is complicated by the fact that these ele- 
ments change rapidly during evolution (70). A 
recent study showed that fin and limb re- 
generation share a deep evolutionary origin 
(11). Therefore, we hypothesized that if the 
genetic mechanisms driving regeneration are 
evolutionarily conserved in distantly related 
species subjected to different selective pres- 
sures, then it should be possible to distin- 
guish between species-specific and conserved 
regeneration-responsive enhancers (RREs). 
The vivid differences in life history and the 
~230 million years of evolutionary distance 
between the zebrafish and the African killi- 
fish Nothobranchius furgeri (fig. S1, A and 
B) provide an exclusive biological context in 
which to test this hypothesis. Both species 
can regenerate missing body parts after am- 
putation. However, whereas zebrafish are found 
in moderately flowing freshwater habitats in 
Southern Asia, killifish inhabit temporal ponds 
subjected to annual desiccation in the south- 
east of Africa (12). The strong selective pressure 
of seasonal desiccation has driven killifish 
to evolve interesting features, including rapid 
sexual maturation (as short as 2 weeks) (13), 
diapause embryos (/4), and an extremely short 
life span (4 to 6 months) (72). Here, we report 
that a systematic comparison of the epigenetic 
and transcriptional changes during the early 
stages of regeneration uncovered an evolution- 
arily conserved regeneration program. We also 
provide evidence that elements of this pro- 
gram are subjected to evolutionary changes 
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in vertebrate species with limited or no regen- 
erative capacities. 


Amputation-responsive enhancers 
evolved in teleosts 


Despite the drastic differences in fin shapes 
and lifestyles, the early morphology of regen- 
erating tail tissues in killifish and zebrafish 
appear indistinguishable from each other (Fig. 
1A and fig. S1, C and D). A tail blastema formed 
by 1 day postamputation (dpa) in both species, 
as indicated by the presence of E-cadherin- 
negative mesenchymal cells above the ampu- 
tation line (Fig. 1B). Blastema cells proliferated 
and expanded rapidly after 1 dpa in both killi- 
fish (fig. S2) and zebrafish (15). Because the 
cells driving the formation of a specialized re- 
generative blastema are recruited to the wound 
site at this stage, we chose this time point for 
comparison. 

Active enhancers and promoters are charac- 
terized by histone H3K27ac and H3K4me3 
marks (J6, 17). We assayed both killifish and 
zebrafish genomes (~1.5 gigabases) for H3K27ac 
and H3K4me3 enrichment using chromatin 
immunoprecipitation sequencing (ChIP-seq) 
in samples of uninjured (O dpa) and regener- 
ating (1 dpa) caudal fin. Our results revealed 
a marked difference in the total number of 
H3K27ac-marked putative RREs that did not 
overlap with promoter regions defined by 
H3K4me3 peaks at transcriptional start sites 
and available gene models between the two 
species: There were 1877 peaks (5% of total 
detected peaks) in killifish and 4162 peaks 
(7%) in zebrafish (Fig. 1C, fig. S3, and table S1). 
Whole-genome alignment revealed a low level 
of sequence conservation of these putative RREs 
compared with gene exons among multiple 
fish species (fig. S4). Furthermore, a relatively 
small portion of the RREs were linked to the 
same genomic loci with H3K4me3-marked 
active promoters in both species (310 genes), 
whereas most peaks were only detected in 
one species or the other (Fig. 1D, fig. S5, and 
table $2). Likewise, there were approximately 
twice as many regeneration-responsive genes 
detected by RNA sequencing (RNA-seq) in 
zebrafish (2829 up-regulated and 3363 down- 
regulated genes) than in killifish (1172 up- 
regulated and 1368 down-regulated genes) 
(Fig. 1E and table S3). Less than half of the 
detected regeneration-responsive genes were 
conserved, including 528 up-regulated and 546 
down-regulated genes [>1.5-fold or <-1.5-fold, 
false discovery rate (FDR) < 0.01; Fig. 1, E and 
F, and table $3]. Similar RNA-seq mapping 
rates and BUSCO scores (an assessment of the 
completeness of genome assembly) were ob- 
tained for both species (fig. S6), indicating that 
the substantial differences observed were un- 
likely to be caused by the differential quality of 
genome assembly. Although some identified 
H3K27ac peaks might derive from differences 
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Killifish regeneration 


Fig. 1. Evolutionary changes and maintenance of the cis-regulome of regen- 
eration in teleosts. (A) Regenerating caudal fins at 1 dpa and a phylogenetic tree 
showing the evolutionary relationships between the African killifish and the zebrafish. 
Scale bar, 200 um. (B) A tail blastema, indicated by the white arrow, forms by 1 dpa 
in both killifish and zebrafish. E-cadherin (ECAD) labels epithelial cells (red). Scale 
bar, 50 um. (C) Heatmaps of regeneration-responsive H3K27ac peaks (nonpromoter 
regions) in killifish and zebrafish. (D) Venn diagram showing 310 overlapping genes 
regulated by RREs between killifish and zebrafish. All genes have H3K4me3-marked 
active promoters that do not overlap with H3K27ac-defined RREs. P = 6.2e-98 


Killifish regeneration 


(hypergeometric test). (E) Large variations in the total number of regeneration- 
responsive genes (>1.5-fold or <-1.5-fold, FDR < 0.01) between killifish and 
zebrafish. (F) Heatmaps of H3K4me3 peaks linked to the 528 shared genes at 
1 dpa. (G) A conserved RRP is composed of 49 RRE-regulated genes (red) with 
H3K4me3-marked active promoters and elevated gene expression. Three genes with 
known functions in zebrafish regeneration are highlighted. (H) Heatmap showing the 
dynamic expression of 49 RRP genes during killifish fin regeneration. (I) Heatmap 
showing the dynamic changes of GO terms enriched by 528 shared up-regulated 
genes during killifish regeneration. The top GO terms are highlighted in red. 


in the cell type composition of the uninjured 
and regenerating tissues, the consistency of our 
results indicates that, compared with zebrafish, 
a relatively less complex genetic response to 
regeneration appears to be invoked by fin 
amputation in killifish. 

Next, we reasoned that if regeneration in 
killifish and zebrafish is driven by similar 
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mechanisms, then an evolutionarily main- 
tained genetic program activated by RREs is 
likely to be present. Comparing the 528 shared 
up-regulated genes with the 310 shared RRE- 
regulated genes, we identified 49 genes (P = 
1.1e-24, hypergeometric test) with H3K27ac- 
defined RREs, H3K4me3-marked active pro- 
moters, and elevated gene expression (Fig. 1, 
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G and H, and table S4). This shared cohort 
encompasses several known and essential 
regulators of zebrafish regeneration, includ- 
ing fgf20a, inhbaa, junbb, and fni (18-21), as 
well as putative new regulators such as cr/f1, 
ompl, and tgfbri1. Gene ontology (GO) term 
analyses of common RRE-regulated genes (7 = 
310) and up-regulated genes (n = 528) revealed 
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top GO terms associated with cell migration 
and cell motility (fig. S7A, Fig. 1D, and table S5), 
and cell division and cell cycle, respectively 
(Fig. 1, I and E, and table S5). Similar analyses 
of species-specific genes showed a species- 
dependent regulation of distinct biological pro- 
cesses during regeneration (fig. $7, B to E, and 
table S6). Our data uncovered not only large- 
scale differences in the activation of RREs and 
gene expression during early stages of regen- 
eration but also an evolutionarily conserved 
regeneration response program (RRP) acti- 
vated by RREs in fish subjected to markedly 
different selective pressures. 


Blastema cells are the primary source 
of RRP gene expression 


To identify the cells deploying the identified 
RRP, we performed single-cell RNA-seq (ScRNA- 
seq) of early killifish and zebrafish regener- 
ation (KR and ZR, respectively). Unsupervised 
analyses uncovered 13 clusters (KRO to KR12) 
from 7208 cells in killifish (Fig. 2A and fig. S8) 
and 16 clusters (ZRO to ZR15) from 8605 cells 
in zebrafish (Fig. 2B and fig. S9). Macrophages 
(KRO and KR1; ZRO, ZR3, ZR9, and ZR11), 
blastema cells (KR2, KR3, KR4, KR5, KR6, and 
KRU; ZR1, ZR2, ZR7, and ZR10), epidermal 
cells (KR7, KR9, and KR10; ZR4, ZR5, ZR6, 
ZR12, and ZR14), and neuronal cells (KR12 
and ZR13) were the shared cell types identi- 
fied (Fig. 2, C and D), whereas red blood cells 
(KR8), neutrophils (ZR8), and endothelial cells 
(ZR15) were only detected in one species, prob- 
ably because of low abundance (Fig. 2D and 
fig. SOK). The blastema cell clusters were de- 
fined by the known blastema markers msx 
homeobox genes (figs. S8F and S9F) (22). A 
new early blastema marker, fsi/7, was also iden- 
tified and confirmed by in situ hybridization 
(Fig. 2C). Using four different markers, cyclin 
A2, mki67, cyclin BI, and pena, we found that 
the cycling cells were mainly enriched in blas- 
tema cells and in subsets of epidermal cells and 
macrophages (fig. S10, A to D). Additionally, the 
blastema clusters identified can be categorized 
by the expression of two wnt genes (wnt5a and 
wntl0a) into two major groups with partial 
overlap in both species (fig. S10, E to J). 
The integrated single-cell analysis identi- 
fied both conserved (630 genes) and species- 
specific blastema marker genes (such as the 
previously identified zebrafish lepb gene) (Fig. 
2E, fig. S11, and table $7). Additionally, we ob- 
served some cell-type discrepancies of gene 
expression between killifish and zebrafish re- 
generation. For instance, complement factor d 
(cfd) was specifically expressed in killifish 
blastema cells, yet the expression of cfd was 
shifted to epidermal cells in zebrafish (fig. S11C). 
Consistent with GO term enrichment analysis 
(Fig. 1D, the expression of shared up-regulated 
genes (n = 528) was enriched (P < 0.01) in 
cycling cells (KR2, ZR1, and ZR14) (figs. S12, 
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Aand B, and S13, A and B). Among these genes, 
80 were specifically expressed in the blastema 
populations (Fig. 2F). The 49 RRP genes dis- 
played significant enrichment (P < 0.05) in 
blastemal clusters (KR2, KR3, KR4, KR5, KR6, 
and KR11; ZR1, ZR2, and ZR7) and basal epi- 
dermal cells (KR10 and ZR5) in both species 
(figs. S12, C and D, and S13, C and D; Fig. 2G; 
and fig. S14A). Our scRNA-seq data support 
the hypothesis that the identified RRP genes 
were mainly expressed in regeneration-specific 
cells, ie. blastema cells. 


Dysregulation of the RRP in animals 
with limited regeneration 


Next, we investigated whether changes in the 
regulation of RRP genes correlated with a var- 
jation of regenerative capacities in other ver- 
tebrates. We compared the RRP gene expression 
in published RNA-seq datasets for mouse spe- 
cies that respond to injury with either regen- 
eration (Acomys cahirinus) or scarring (Mus 
musculus) (23, 24). Twenty of 49 teleost-defined 
RRP genes were significantly up-regulated 
(>1.5-fold, FDR < 0.01) during ear pinna regen- 
eration in A. cahirinus (fig. S14, B and C). By 
contrast, their expression in the nonregener- 
ating ear pinna of M. musculus was dysregu- 
lated (Fig. 2H; fig. S14, B and C; and table S8). 
For example, cr/f1, itga4, and thal were signif- 
icantly up-regulated in A. cahirinus during 
regeneration but not in M. musculus during 
scarring. Moreover, the transforming growth 
factor-B (TGF-B) ligand inhba (i.e., activin A 
or activin) was highly and continuously ac- 
tivated during scarring in M. musculus but 
was only up-regulated during early stages of 
regeneration in A. cahirinus (Fig. 2H and fig. 
$14, B and C). This is consistent with reports 
that overexpression of inhba in mouse skin 
accelerates wound healing but enhances scar 
formation (25, 26). Similarly, dysregulation 
of RRP genes was also observed between skin 
regeneration and scarring (fig. S14, D to F, and 
table S8). The failed or altered activation of 
certain RRP genes during scarring suggests 
that teleost-defined RRP has likely been sub- 
jected to evolutionary changes in regeneration- 
competent and -incompetent animals. 


The RRE K-IEN directs gene activation after 
amputation and is essential for regeneration 


To test whether the identified enhancers play 
a role in regeneration, we validated five ChIP- 
identified RREs regulating inhba(2of2), fgf20 
(2o0f2), junb(of2), vmpl, and mbd2 in killifish 
(Fig. 3, A and B; fig. S15; and Fig. 1H). We fo- 
cused on the gene inhba because it is required 
in both tail and heart regeneration in zebrafish 
(19, 27) and is differentially regulated between 
regenerating and nonregenerating tissue (Fig. 
2H). Two copies of inhba exist in the genome 
of killifish, but only inhba(2of2) responded 
to amputation (fig. S16A). To characterize the 
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killifish inhba(2of2) enhancer, we cloned a 
1159-bp DNA sequence (referred to as K1159) 
marked by a H3K27ac peak upstream of the 
gene promoter into a transgenic vector with 
a green fluorescent protein (GFP) reporter 
and produced stable transgenic killifish (Fig. 
3C). Robust reporter expression was detected 
in the blastema region after fin amputation 
in K1159:GFP-transgenic fish (fig. SI6B). Sim- 
ilarly, we also observed amputation-activated 
GFP expression for four additional enhancers 
(fig. S15), supporting the validity of our ap- 
proach for identifying regeneration-activated 
enhancers. 

By generating four additional constructs 
with different truncations (Fig. 3C), we iden- 
tified a minimal sequence for the killifish 
inhba(2of2) enhancer (K-IEN), which recapitu- 
lated the original K7159:GFP expression and 
the endogenous inhba(2of2) expression (Fig. 
3, Aand D, and fig. S16C). We found that not 
all types of injury activated the identified en- 
hancer similarly. The most robust response 
was observed when the damage involved the 
regeneration of multiple tissues (e.g., bone and 
interray tissues) compared with only interray 
tissue removal, and noticeably less robust ex- 
pression was detected after performing a small 
incision without tissue loss (Fig. 3E). We also 
observed a stronger response in proximal am- 
putations compared with distal amputations 
(Fig. 3F). We conclude from these data that 
K-IEN directs gene expression in response to 
different types of injuries and positional cues. 

Because inhba is activated and required 
during zebrafish heart regeneration (27), we 
next investigated whether K-JEN also exhib- 
ited enhancer activity in killifish hearts. Sim- 
ilar to zebrafish heart regeneration (28), we 
observed a minor fibrotic scar at 7 days post- 
injury (dpi) and regression of the scar at 18 dpi 
through acid fuchsin orange G (AFOG) stain- 
ing (Fig. 4, A to C). Moreover, killifish cardio- 
myocytes maintained the ability to proliferate 
in response to injury (Fig. 4D). These results 
confirm that the killifish heart is regeneration 
competent. Upon heart resection of K-IEN:GFP 
killifish, we observed robust GFP activation 
in the regenerating heart tissue, which had yet 
to form fully differentiated cardiac myofibers 
as defined by the lack of differentiated cardiac 
muscle marker tropomyosin (Fig. 4E). By con- 
trast, the uninjured region (tropomyosin posi- 
tive) was devoid of detectable GFP expression. 
Additionally, the expression of GFP was not 
detected in the developing fins and hearts of 
K-IEN:GFP killifish (fig. S16, D to F). We con- 
clude that, as in caudal fin regeneration, the 
activation of K-IEN is regeneration dependent 
in the heart. 

To determine whether K-JEN is required for 
regeneration, we designed two guide RNAs to 
target K-JEN in killifish using the CRISPR- 
Cas9 approach (Fig. 4F). Disruption of K-IEN 
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Fig. 2. The RRP deployed by regeneration-specific cells is dysregulated in 
regeneration-incompetent animals. (A) T-distributed stochastic neighbor 
embedding (t-SNE) plot showing 13 different cell clusters identified in early KR; 
7208 cells were included in the analyses. (B) t-SNE plot showing 16 different 
cell clusters identified in early ZR; 8605 cells were included in the analyses. 
(C) Annotation of killifish and zebrafish cell clusters. The expression of fst/1 in 
the early killifish blastema cells was confirmed by in situ hybridization. 

White dashed line indicates the amputation site. (D) Integrated single-cell 
analysis between killifish and zebrafish. Left, annotation of major cell types. 
Right, percentage of cells contributed by killifish and zebrafish. (E) Expression 
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of shared and species-specific blastema marker genes identified in the 
integrated analysis. (F) Expression of 528 shared genes in different cell 

types identified by scRNA-seq; 80 genes were specifically detected in the 
blastema cells, and 232 genes were detected in two or more cell types. 

(G) Examples of the expression of RRP genes in t-SNE-clustered killifish cells. 
Only the enriched clusters are displayed for each gene. (H) Differential 
regulation of 12 teleost-defined RRP genes between regenerating ear pinna in 
the African spiny mouse A. cahirinus (blue) and nonregenerating ear pinna 

in the house mouse M. musculus (green). Four representative genes are 
highlighted in red. 
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Fig. 3. Regeneration-activated inhba expression is mediated through the 
RRE K-IEN. (A) The dynamic expression of inhba(Zof2) in killifish caudal fin 
regeneration. Right, the expression of inhba(2of2) in blastema cells at 2 dpa. 
(B) An RRE marked by H3K27ac peaks (red box) at the inhba locus in killifish and 
zebrafish. (C) Transgenic constructs examined for regeneration-dependent 
expression in killifish caudal fin. Top, design of a Tol2 transgenic vector. Con- 
structs marked with green (K1159, K-S3, and K-S4) display enhancer activity in 
fin tissue. K-IEN (K-S4) is the minimal enhancer. (D) Images from the transgenic 


reporter line K-IEN:GFP. Left, expression of GFP at O and 2 dpa. Right, costaining 
of GFP (green) and E-cadherin (red) on 2 dpa cryosections. Scale bar, 50 wm. 
(E) Expression of K-IEN:GFP in different types of injury. Tissues were removed 

by a 1l-mm-diameter biopsy punch. Top, the damaged regions at O dpa are 
outlined (red). Bottom, GFP expression in the damaged regions (star) at 1 dpa. 
(F) Expression of K-IEN:GFP in response to proximal and distal amputation. 
The orientation of all caudal fin images is proximal to the bottom and distal to 
the top. Dashed line indicates the amputation site. 


significantly delayed tail regeneration in ho- 
mozygous mutants compared with wild-type 
animals (Fig. 4G and fig. S17, A and B). Fur- 
thermore, heart regeneration was also im- 
paired, leading to a failure of scar resolution 
at the injury site (Fig. 4H and fig. S17, C and 
D). However, cardiomyocyte proliferation was 
not affected in the mutants (Fig. 41. Our data 
reveal that K-JEN is an RRE with pleiotropic 
function and is required for tissue regener- 
ation in killifish. 


Detecting evolutionary changes of an essential 
RRE in vertebrates 


To determine whether RREs with essential re- 
generation roles are evolvable, we sought to 
identify orthologs of K-IEN in different verte- 
brate lineages using mVISTA (29). Multiple 
sequence alignments detected a relatively con- 
served noncoding block near the inhba loci in 
killifish, zebrafish, and human (Fig. 5A). The 
overlap between the predicted zebrafish se- 
quence and the H3K27ac-marked region sug- 
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gests that the predicted enhancers are likely to 
be biologically relevant. We cloned DNA frag- 
ments containing the predicted zebrafish and 
human elements and generated stable trans- 
genic reporter lines in killifish for each of them. 
The zebrafish inhba enhancer (Z-IEN) drove 
regeneration-dependent GFP expression in a 
manner indistinguishable from that of K-IEN 
(Fig. 5, B and C). By contrast, the expression of 
GFP directed by the predicted human inhba 
enhancer (H-JEN) was barely detectable be- 
fore 2 dpa but was robustly observed by 3 dpa 
and persisted to 5 dpa (Fig. 5D and fig. S16B). 
Unlike K-IEN:GFP and Z-IEN:GFP, the expres- 
sion of H-IEN:GFP was restricted to the basal 
epidermal cells rather than to the mesenchy- 
mal cells (Fig. 5D), reminiscent of the activa- 
tion of inhba in human and mouse skin upon 
injury (26, 30). We also observed amputation- 
dependent activation of GFP expression for Z- 
IEN:GFP, but not H-IEN:GFP, during killifish 
heart regeneration (Fig. 5E). Instead, H-IEN 
directed GFP expression during homeostasis 
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in the endocardium cells and some epicar- 
dium cells (Fig. 5F). The ability to rescue the fin 
regeneration phenotype in K-JEN’ 7~ mutants 
through reexpression of killifish inhba driven 
by Z-IEN, but not H-IEN, implies a functional 
change of the enhancer (fig. S18). These results 
suggest that an ancestral, evolutionarily con- 
served teleost RRE with an indispensable role 
in regeneration has diversified its functions, 
implicating RRE evolutionary turnover as a 
potential mechanism underlying variation 
in the regenerative capacities of vertebrates. 


Genomic occupancy of AP-1 motifs is essential 
for RRE activities 


To investigate what determines the injury re- 
sponsiveness in the identified RREs, we per- 
formed motif enrichment analyses on both 
sets of conserved and species-specific elements. 
We found that a consensus 12-O-tetradecanoyl- 
phorbol-13-acetate responsive element (TRE), 
TGA[G/C]TCA, which was recognized by the 
AP-1 transcription factor complex, was the 
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most enriched motif in all analyses performed 
(Fig. 6, A and B, and fig. S19, A to C). Similarly, 
AP-1 motifs were enriched in the open chro- 
matin regions involved in Drosophila imaginal 
disc regeneration and regeneration in the 
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Fig. 4. The RRE K-IEN is required for regeneration. (A to C) AFOG staining 
on cryosections of uninjured (A), 7 dpi (B), and 18 dpi (C) killifish hearts to detect fibrin 
(red) and collagen (blue). Diagram in (A) shows the resection of killifish heart 
ventricle. Magnified views of collagen deposition (white arrowhead) in the injured 
region are outlined with a dashed box. n = 10. (D) PCNA (red) and DAPI (blue) staining 
on cryosections of uninjured (top) and 5 dpi (bottom) killifish hearts. n = 10. 

(E) Expression of K-IEN:GFP in 7 dpi killifish hearts. Top, merge of GFP, TPM1, and 
DAPI images. The uninjured region is marked by tropomyosin (TPM1). A magnified 
view of GFP is outlined with a dashed box. n = 5. (F) Generation of homozygous 
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acoel worms (fig. S19, D and E) (3/, 32). The 
AP-1 complex is a heterodimer composed of 
members from different families of DNA- 
binding proteins, including the Jun, Fos, ATF, 
JDP, and Maf families (33). AP-1 binds both 


TRE and the cAMP response element (CRE; 
TGACGTCA) (34). Cell-type-biased expression 
of AP-1 components was detected in both kil- 
lifish (fig. S20, A to E) and zebrafish (fig. S21) 
scRNA-seq of blastema formation, indicating 
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K-IEN mutants. Top, schematic diagram showing the disruption of K-IEN through 
CRISPR-Cas9. Bottom, PCR genotyping of a homozygous K-IEN“~ mutant. (G) Fin 
regeneration is significantly delayed in K-IEN~ mutants. Right, quantification of the 
regenerated tissue at 3 dpa. n = 10. **P < 0.01. (H) AFOG staining on cryosections 
of K-IEN” mutant hearts at 18 dpi. n = 10. (I) Injury-triggered cardiomyocyte pro- 
liferation was not altered in the K-IEN~ mutant at 5 dpi. The percentages of myocardial 
nuclei undergoing DNA replication (PCNA staining) at the injury site were quantified. 
n=10.***P < 0.001. ns., not significant (P > 0.05). Student's t test was performed in 
the results shown in (G) and (I). Dashed line indicates the injury site. 
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Fig. 5. Evolutionary changes of K-IEN activities in vertebrates. (A) VISTA 


alignment of inhba loci among killifish, zebrafish, and 


represent high levels of sequence conservation, and the absence of peaks 
indicates no significant conservation. The killifish RRE is marked in green. 
Bottom, schematic diagram showing the overlap between the zebrafish H3K27ac 
peak and the predicated enhancer (blue). (B) GFP expression driven by the 


zebrafish enhancer Z-/EN at 2 dpa in killifish caudal fi 


that different cell populations may form dis- 
tinct AP-1 complexes. Further, genome-wide 
prediction of AP-1 motifs among different spe- 
cies showed that CRE motifs recognized by the 
Jun family proteins (Jun, JunB, and JunD) exist 
at a much higher frequency in regeneration- 
competent fish genomes than in human and 
mouse genomes (fig. S22). Taken together, 
these results identify AP-1-binding sites as a 
shared characteristic of all RREs identified in 
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this study and uncover differences in the fre- 
quency of predicted AP-1-binding motifs be- 
tween regeneration-competent and -incompetent 
animals. 

To determine whether AP-1 motifs are es- 
sential for the activity of RREs, we identified 
predicted AP-1-binding sites in both K-IEN 
(GCTGACTCAGA and GCTGACTCACTG) and 
Z-IEN (GCTGACTCAT and GCTGACTCTA) and 


subjected them to site-specific mutagenesis 
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Z-IEN:GFP under different types of injury in killifish caudal fin. (D) GFP expression 
driven by the human enhancer H-/EN initially detected at 3 dpa (middle) in 


s detected in the basal epidermal cells (arrow). 

nt expression of Z-IEN:GFP at 7 dpi in killifish hearts. 
with a dashed box. (F) The expression of H-IEN:GFP is 
is and is not regeneration dependent. Dashed line 


indicates the injury or amputation site. Scale bar, 50 um. 


(Fig. 6C). All motifs were mutated into GCA- 
AAAAAAAA or GCAAAAAAAA (Fig. 6, C to E). 
Stable transgenic reporter assays revealed 
that the expression of GFP driven by either the 
K-IEN™”- or Z-IEN™“”-mutated enhancers was 
completely abolished compared with the orig- 
inal enhancers (Fig. 6, C to E). Furthermore, 
blocking the activation of the AP-1 complex 
through the JNK pathway inhibitor SP600125 
diminished the activity of K-IEN and inhibited 


7 of 9 


RESEARCH | RESEARCH ARTICLE 


A Motif enrichment B Motif enrichment Cc 
(conserved RREs) ia (Species-specific RREs) AP-1 motifs 
€@ 100 : i) AP-1 300 a na 
| © Q 100. 
ZleEs 200. : 1, GCTGACTCAGA 
S ge 50, - a ’ ‘np i K-IEN 2, GCTGACTCACT 
OF: ebiltiaEn chess... attinheas en ww o- 1, 48 bp 
0 25 50 75 0 25 50 75 1 2 
ee 00° 
S| es" ae 750. : AP-1 1, GCTGACTCAT 
‘w] Eo 1 500. 2, GCTGACTCTA 
Fol Boas : . catia 594b 
deeaet oo 2 P 
N| "7 0, <fdieatelPabesidares 99), iitdEDna mate on: 
0 25 50 75 0 25 50 75 
% of Target Sequences % of Target Sequences 


Fig. 6. Occupancy of AP-1-binding motifs is essential for RRE activities. 

(A and B) Motifs enriched in the conserved (A) and species-specific (B) RREs 
identified in killifish (top) and zebrafish (bottom) caudal fin regeneration. 

AP-1 motifs are highlighted in red. Each dot in the graph represents a single 
binding motif. The sequence of AP-1 motifs is shown in fig. S19. (C) Identification 
of AP-1 motifs in the RREs K-IEN and Z-IEN. (D and E) Expression of GFP 
driven by K-IEN? and Z-IEN"”? is abolished at 2 dpa in transgenic reporter lines. 
Right, quantification of the fluorescence intensity between wild-type and 
mutant enhancers. P < 0.001 (Student's t test). n = 10. Dashed lines indicate 


tail regeneration (fig. S20, F and G). We con- 
clude that AP-1 motifs are required for the 
activation of RREs in response to amputation. 


Discussion 


Activation or inactivation of genes is suspected 
to underlie changes in regenerative capacities, 
yet how these genetic activities are regulated 
remains poorly understood. AP-1 transcription 
factors are essential for many biological pro- 
cesses, and their diverse functions are part of 
complex dynamic networks of signaling path- 
ways known to depend on subunit composition 
and interactions with other nuclear factors, 
which in turn are in part determined by both 
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cell type and cellular environment (33). In this 
study, AP-1 components were not ubiquitously 
expressed in all cell types and were present in 
both mesenchymal and epithelial cells, suggest- 
ing that specific subunit compositions may be 
required to restrict expression of the enhancers 
to either the mesenchyme (K-IEN, Z-IEN) or 
epithelial cells (H-ZEN). Additionally, the pres- 
ence of a predicted p53/p63-binding motif 
in the human enhancer, which was absent in 
K-IEN and Z-IEN (fig. S23A), and the abun- 
dance of p63 expression in basal epidermal 
cells (fig. S23B) suggest that interactions of 
AP-1 with other nuclear factors may also play 
a role in regulating enhancer activity. 
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Given the ancient evolutionary origin of the 
AP-1 complex (35, 36), we hypothesize that the 
ancestral function of AP-1 motif-enriched en- 
hancers was to activate a regenerative response, 
and that through the course of evolution and 
speciation, regeneration and injury responses 
became dissociated from each other in some, 
but not all, enhancers (Fig. 6F). Repurposing of 
ancestral regulatory sequences to generate new 
regulatory functions is not without precedent 
(37) and has been well documented in both 
vertebrates (38) and invertebrates (39). For in- 
stance, frequent regulatory element repurpos- 
ing was revealed by DNase I-hypersensitive 
sites in the mouse and human genomes (40). 
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These studies demonstrated that regulatory 
elements in orthologous loci were functionally 
active in distinct tissues, indicating that cis- 
regulatory plasticity may be a key facilitator of 
vertebrate evolution (40). Future experiments 
aimed at determining the in vivo composition 
of the AP-1 complexes associated with both the 
evolutionarily conserved RREs and the species- 
specific injury response enhancers may not 
only help to identify mechanisms underpinning 
enhancer repurposing but also help to resolve 
the long-standing problem of why some species 
can regenerate missing body parts after am- 
putation whereas others cannot. 


Material and methods summary 


Bulk RNA-seq and ChIP-seq (H3K27ac and 
H3K4me3) data were obtained from ampu- 
tation sites at O dpa (control) and 1 dpa for 
transcriptomic and epigenomic analyses of 
blastema formation in African killifish and 
zebrafish. Regeneration time-course RNA-seq 
was performed at 3, 6, and 14 hours postampu- 
tation and at 1, 2, 3, 4, 7, and 18 dpa in African 
killifish. These data were used to define the 
RREs and genes and to identify an evolution- 
arily maintained RRP. scRNA-seq data were 
obtained from regenerating blastema at 1 dpa 
and used to determine cell types deploying the 
identified RRP. To characterize RREs, trans- 
genic reporter assays were performed in African 
killifish. The function of the killifish inhba en- 
hancer was determined using CRISPR-Cas9- 
mediated genome engineering. The human 
inhba enhancer was identified using the mVISTA 
tool. Motif analysis was used to identify key 
transcription factor-binding sites enriched by 
ChIP-identified RREs. The function of these 
binding sites was validated using site-specific 
mutagenesis followed by transgenic reporter 
assays. 
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Regulation of sleep homeostasis mediator adenosine 
by basal forebrain glutamatergic neurons 
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INTRODUCTION: Sleep homeostasis, the balance 
between the duration of sleep and wakefulness, 
is a fundamental feature of the sleep-wake 
cycle. During wakefulness, sleep-promoting 
somnogenic factors accumulate and cause 
an increase in sleep pressure or our need for 
sleep. Decades of research have identified many 
genes, molecules, and biochemical processes 
involved in the regulation of sleep homeostasis. 
Among various processes implicated in sleep 
homeostasis, adenosine—a critical component 
of the cell metabolic pathway—is a prominent 
physiological mediator of sleep homeostasis. 
Adenosine released in the basal forebrain (BF), 
a brain region that plays a critical role in reg- 
ulating the sleep-wake cycle, can suppress neu- 
ral activity mediated by the Al receptor and 
increase sleep pressure. In addition, the sleep- 
wake cycle is controlled by different patterns 
of neural activity in the brain, but how this 
neural activity contributes to sleep homeosta- 
sis remains mostly unclear. In this study, we 
examine the neural control of sleep homeosta- 
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sis by investigating in detail the mechanisms 
underlying adenosine increase in the BF. 


RATIONALE: Because the traditional microdial- 
ysis measurement of adenosine concentration 
has a poor temporal resolution, we first designed 
a genetically encoded G protein-coupled recep- 
tor (GPCR)-activation-based (GRAB) sensor for 
adenosine (GRABaq,), in which the amount 
of extracellular adenosine is indicated by the 
intensity of fluorescence produced by green 
fluorescent protein (GFP) (see the figure, panel A). 
Using the GRABago, we first measured the dy- 
namics of extracellular adenosine concentrations 
during the sleep-wake cycle in the mouse BF. 
We then used a simultaneous optical record- 
ing of the Ca”* activity in different BF neurons 
and the change in adenosine concentrations 
to examine the correlation between adenosine 
increase and neural activity. We further studied 
the ability of different BF neurons in controlling 
the adenosine release using optogenetic activa- 
tion. Finally, we used cell type-specific lesion to 


B Optogenetic activation 


Neural control of rapid adenosine dynamics and sleep homeostasis. (A) Simultaneous optical recording of 
the Ca* activity and adenosine concentration using GCaMP and GRABag, reveals neural activity-dependent 
rapid adenosine dynamics in the mouse basal forebrain (BF) during the sleep-wake cycle. (B) Optogenetic 
activation of BF glutamatergic neurons evokes a robust increase of extracellular adenosine. (C) Cell type-specific 
lesion of BF glutamatergic neurons significantly increases wakefulness. 
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confirm the contribution of BF neurons in con- 
trolling the increase in adenosine concen- 
trations and examine its contribution to the 
sleep homeostasis regulation. 


RESULTS: We found that the amount of extra- 
cellular adenosine was high during wakeful- 
ness and low during non-rapid eye movement 
(NREM) sleep. Benefiting from the high tem- 
poral resolution of the GRABaqo, we also found 
a prominent increase in adenosine during REM 
sleep and revealed rapid changes in adenosine 
concentrations during brain state transitions. 
Simultaneous fiber photometry recording of the 
Ca?" activity in different BF neurons and the 
change in extracellular adenosine concentra- 
tions showed that both cholinergic neurons and 
glutamatergic neurons had highly correlated ac- 
tivity with changes in the adenosine concentra- 
tion (see the figure, panel A). In examining the 
time course of the two signals, we found that neu- 
ral activity always preceded changes in ade- 
nosine dynamics by tens of seconds. When we 
measured the evoked adenosine release by op- 
togenetic activation of these two types of neu- 
rons using their physiological firing frequencies, 
we found that the activation of BF cholinergic 
neurons only produced a moderate increase in 
extracellular adenosine; by contrast, the activa- 
tion of BF glutamatergic neurons caused a large 
and robust increase (see the figure, panel B). 
Finally, we selectively ablated BF glutamatergic 
neurons and found a significantly reduced in- 
crease in the amounts of extracellular adeno- 
sine. Also, mice with a selective lesion of BF 
glutamatergic neurons showed impaired sleep 
homeostasis regulation, with significantly in- 
creased wakefulness during the active period 
(see the figure, panel C). 


CONCLUSION: Here, we report the design and 
characterization of a genetically encoded aden- 
osine sensor with high sensitivity and specificity, 
and high temporal resolution; using the sensor, 
in combination with fiber photometry recording, 
optogenetic activation, and cell type-specific le- 
sion, we demonstrate a neural activity-dependent 
rapid dynamics of the extracellular adenosine 
concentration during the sleep-wake cycle in the 
mouse BF and uncover a critical role of the BF 
glutamatergic neurons in controlling adenosine 
dynamics and sleep homeostasis. These find- 
ings suggest that cell type-specific neural ac- 
tivity during wakefulness can contribute to the 
increase in sleep pressure by stimulating the 
release of somnogenic factors. 
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Sleep and wakefulness are homeostatically regulated by a variety of factors, including adenosine. 
However, how neural activity underlying the sleep-wake cycle controls adenosine release in the brain 
remains unclear. Using a newly developed genetically encoded adenosine sensor, we found an activity- 
dependent rapid increase in the concentration of extracellular adenosine in mouse basal forebrain (BF), a 
critical region controlling sleep and wakefulness. Although the activity of both BF cholinergic and 
glutamatergic neurons correlated with changes in the concentration of adenosine, optogenetic activation 
of these neurons at physiological firing frequencies showed that glutamatergic neurons contributed 
much more to the adenosine increase. Mice with selective ablation of BF glutamatergic neurons 
exhibited a reduced adenosine increase and impaired sleep homeostasis regulation. Thus, cell type- 
specific neural activity in the BF dynamically controls sleep homeostasis. 


omeostatic regulation is a fundamental 

phenomenon of the sleep-wake cycle, 

and sleep-promoting somnogenic fac- 

tors accumulate during wakefulness, 

thereby inducing sleep (J, 2). Several 
extracellular or cytoplasmic factors and asso- 
ciated biochemical processes that contribute to 
this phenomenon have been identified (3-5). 
In addition, different patterns of neural activ- 
ity in the brain control the sleep-wake cycle, 
but how this neural activity contributes to 
sleep homeostasis remains largely unknown 
(6). Among various processes implicated in 
controlling sleep homeostasis (3, 7, 8), the 
release of adenosine in the basal forebrain 
(BF) is a prominent physiological mediator 
of sleep homeostasis (9-12). In this study, we 
used a genetically encoded adenosine sensor 
to examine in detail the mechanisms under- 
lying the increase in adenosine concentration 
in the BF, a brain region that plays a critical 
role in regulating the sleep-wake cycle (13, 14). 


Development and characterization of a 
genetically encoded adenosine sensor 


To record the dynamics of extracellular aden- 
osine level in the BF during the sleep-wake 
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cycle with high temporal resolution and high 
specificity and sensitivity, we designed a ge- 
netically encoded G protein-coupled receptor 
(GPCR)-activation-based (GRAB) sensor for 
adenosine (GRABagqo), in which the amount of 
extracellular adenosine is indicated by the in- 
tensity of fluorescence produced by green flu- 
orescent protein (GFP) (Fig. 1A). 

The sensor was developed by using an es- 
tablished GRAB sensor development pipeline 
(15-20): We first screened candidate sensor 
scaffolds by inserting a conformational-sensitive 
circularly permuted enhanced GFP (cpEGFP) 
into different adenosine receptors using linker 
peptides (fig. S1A); we then selected an Ay, re- 
ceptor (Ay,R)-based chimera (GRABaqoo.1) for 
further optimization because of its good mem- 
brane trafficking and high fluorescence response 
upon adenosine application (fig. SIB); we next 
systematically optimized the length and amino 
acid composition of the linkers between the 
AgaR and the cpEGFP and identified a sensor 
with the largest fluorescence response (fig. S1, 
C and D), which we named GRABago10 (here- 
after referred to as Adol.0). 

In human embryonic kidney 293T (HEK293T) 
cells, Ado1.0 showed good membrane trafficking 
and produced a 120% peak response (change 
in fluorescence intensity, AF/F,) to the applica- 
tion of a saturated concentration of adenosine 
(100 uM) (Fig. 1B); by contrast, a non-ligand- 
binding mutant form of the sensor [F168A (27); 
GRABagdotomut or Adol.Omut for short] showed 
no detectable response (Fig. 1B and fig. S2E). 
Adol.0 had rapid response kinetics, with a rise 
time constant (to,) of 68 + 13 ms (Fig. 1C). In 
neurons, Ado1.0 was widely distributed through- 
out the membrane, including the soma, axons, 
and dendrites (Fig. 1D and fig. S2A), and re- 
sponded to adenosine application in a dose- 
dependent manner (Fig. 1E and fig. S2C), with 
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a median effective concentration (EC;,) of 
~60 nM (Fig. 1F). Ado1.0 responded to aden- 
osine with high selectivity, because it showed 
an undetectable or much weaker response to 
several structurally similar derivatives of aden- 
osine, such as adenosine 5°-diphosphate (ADP), 
adenosine 5°-triphosphate (ATP), inosine, and 
adenine (Fig. 1F and fig. S2B). In addition, 
adenosine-induced fluorescence response can 
be blocked by the AgaR antagonist SCH-58261 
(fig. S2, B to D). 

Next, we examined whether Adol1.0 expres- 
sion affects cellular physiology. Using a lucif- 
erase complementation assay, we found that 
Ado1.0 had almost no downstream G, coupling, 
in contrast to the robust coupling produced 
by cells expressing AgaRs (Fig. 1H, middle, 
and fig. SIE, middle). Similarly, we found no 
detectable downstream cyclic adenosine 3” ,5°- 
monophosphate (cAMP) activation induced by 
the Aj,R agonist HENECA in Ado1.0 (Fig. 1H, 
right, and fig. S1E, right). Consistent with the 
minimum activation of intracellular signal- 
ing pathways, Ado1.0 showed no detectable 
internalization, as we observed no significant 
decrease of fluorescence in Adol.0-expressing 
cells when applying a high concentration of 
adenosine (10 uM) for 2 hours (Fig. 1G and fig. 
$2D). Finally, there was no difference in either 
field stimulation-evoked Ca”* signaling (Fig. 11 
and fig. S3) or K*-evoked glutamate release (fig. 
S3) between Adol1.0-expressing neurons and 
nontransfected neurons, suggesting that ex- 
pression of Adol.0 did not measurably alter 
Ca** signaling or neurotransmitter release. 

Together, these results show that Adol1.0 can 
detect rapid dynamics of extracellular adeno- 
sine levels with high sensitivity and specificity; 
at the same time, its expression has no detec- 
table effect on cell physiology. 


Dynamics of extracellular adenosine in the 
sleep-wake cycle 


We next examined the dynamics of extracel- 
lular adenosine concentration in the BF during 
the sleep-wake cycle using the GRABaqo sensor. 
We injected an adeno-associated virus (AAV) 
expressing GRABago into the BF and measured 
the fluorescence signal using fiber photometry 
through an implanted optical fiber (Fig. 2A 
and fig. S4); as an internal control (e.g., for 
correcting movement artifacts), we coexpressed 
a red fluorescence protein mScarlet, which is 
insensitive to changes in adenosine concentra- 
tion. The adenosine signal was then extracted 
from the measured fluorescence signals using 
a blind source separation method (22). 

We observed significantly higher amounts 
of extracellular adenosine when the mice were 
awake, as compared with that during non- 
rapid eye movement (NREM) sleep (Fig. 2B, C, 
and E; P = 3.6 x 10°), consistent with previous 
microdialysis measurements (9, 10). Such a dif- 
ference was not observed in mice expressing a 
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Fig. 1. Design and characteri- 
zation of genetically 
encoded adenosine sensors. 
(A) Schematic drawing 
depicting the principle of the 
GRABadgo sensors. The third 
intracellular loop of the human 
AoaR was replaced with 
cpEGFP; thus, the binding of 
adenosine induces a conforma- 
tional change that increases 
the cpEGFP fluorescence. D 
(B) Expression and responses 
of the GRABagoi.g and 
GRABado1.omut in HEK293T cells. 
(Left) Images of sensor 
fluorescence before and after 
application of 100 uM Ado 
(scale, 10 um). (Middle and 
right) Time course and 
summary of peak AF/Fo; 

n = 20 cells from two cultures. 
(C) Rise and decay time 
constants of the Adol.0 
fluorescence in response to 
the application of Ado (100 uM) 
followed by the AzaR antagonist 
SCH-58261 (200 uM). n = 10 
and 4 cells, respectively. 

(D) Expression of Adol.0 in 
cultured neurons (scale bars, 


hAzaR 


30 yum). (E) Response of Adol.0 in cultured neurons; n = 28 to 30 regions of interest (ROIs) in three cul 
neurons in response to Ado, ADP, and ATP; n = 20 ROls each. (G) Normalized AF/Fo of Adol.0-expressing 
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nonbinding mutant form of the adenosine sen- 
sor (fig. S6 and Fig. 2D; Wake versus NREM: 
P = 0.08; REM versus NREM: P = 0.94). 
Benefiting from the high temporal resolution 
of the GRABago sensor, we also observed a 
significantly higher amount of adenosine 
during REM sleep than during both wakeful- 
ness and NREM sleep (Fig. 2, B, C and E; REM 
versus Wake: P = 0.002; REM versus NREM: 
P =1.7x 107°). Such an increase during REM 
sleep could not be measured using micro- 
dialysis because of the short duration of REM 
sleep in mice (9, 23). Furthermore, our mea- 
surements showed rapid change in the aden- 
osine level as mice transitioned between three 
different brain states (Fig. 2, B and F), reveal- 
ing an average rise time of 29.3 + 3.6 s (mean + 
SEM). This rapid change in extracellular aden- 
osine levels suggests a neural activity-dependent 
release (24, 25). 


Neural control of fast adenosine transients in 
the BF during the sleep-wake cycle 


Next, we examined how the activity of differ- 
ent neuronal types in the BF contributes to the 
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observed adenosine dynamics during the 
sleep-wake cycle. The BF neural circuits for the 
sleep-wake regulation have been character- 
ized (7, 8, 26): Cholinergic neurons (expressing 
choline acetyltransferase, ChAT) and glutama- 
tergic neurons (expressing vesicular glutamate 
transporter 2, VGLUT2) are highly active during 
both wakefulness and REM sleep, and optoge- 
netic activation of these two cell types promotes 
wakefulness (1/4). We first examined the role of 
BF ChAT+ neurons by measuring the correla- 
tion between the activity of ChAT+ neurons and 
the change in extracellular adenosine concen- 
tration. We injected AAV expressing GRABago 
into the BF of one hemisphere and AAV express- 
ing the Cre-dependent Ca?* indicator GCaMP6s 
(27) into the contralateral BF of the ChAT-Cre 
mice (28), and measured the fluorescence sig- 
nal using fiber photometry 2 weeks after injec- 
tion (Fig. 3A and fig. $7). This “bilateral dual 
probes” method allowed us to simultaneously 
measure the signals of two interfering fluores- 
cent probes in the same brain region. The pop- 
ulation Ca?* activity of ChAT+ neurons had a 
time course similar to that of the extracellu- 
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lar adenosine increase (Fig. 3B). The size of 
GRABago event strongly correlated with the 
size of the corresponding GCaMP event (Fig. 
3C; Pearson’s r = 0.83, P < 0.0001), and such 
correlation was not observed when the GCaMP 
signals were randomly shuffled (Fig. 3D; 
Pearson’s 7 = 0.06, P = 0.32). Moreover, the 
change in population Ca”* activity mea- 
sured in ChAT+ neurons often preceded the 
change in the adenosine signal by ~24 s (Fig. 
3E), suggesting that BF ChAT+ neurons may 
control the amount of extracellular adeno- 
sine (29, 30). 

We therefore optogenetically activated ChAT+ 
neurons and used fiber photometry to measure 
the evoked adenosine release by coinjecting 
AAVs expressing GRABag, and Cre-dependent 
red-shifted channelrhodopsin, ChrimsonR (37), 
into the BF of ChAT-Cre mice (Fig. 3F and fig. 
S8). Activating ChAT+ neurons (638 nm laser, 
10 ms/pulse, 10 Hz for 8 s) induced a slight but 
significant increase in extracellular adenosine 
(Fig. 3, Gto I; peak signal: P = 0.014; Dru. 
P = 0.023), indicating that ChAT+ neurons 
may indeed provide some contribution to the 
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Fig. 2. Adenosine dynamics in the mouse basal 


forebrain during the sleep-wake cycle. (A) Schematic 


diagram depicting fiber photometry recording of 


extracellular adenosine during the sleep-wake cycle in 


freely moving mice. (B) (Top to bottom) EEG power 


spectrogram; electromyogram (EMG) (scale, 0.5 mV); 
ratio between EEG theta power (@) and delta power (8) 


(scale, 2); GRABado fluorescence (scale, 1 z-score). 


Ado 


GRABaco 


The brain states (fig. S5) are color-coded; the same color 


code is used in all the following figures. (©) GRABago 


fluorescence in different brain states. Each line represents 


data from one recording. n = 11 sessions from four mice. 


***P < 0.001 (Student's paired t test); Wake versus 
NREM: P = 3.6 x 10-®; REM versus NREM: P = 1.7 x 


10°. In this and all subsequent figures, summary data 
are expressed as the mean + SEM. (D) Fluorescence of 
the mutant sensor in different brain states. n = 7 sessions 
not significant (Wilcoxon signed- 

: P = 0.08; REM versus 
ormalized modulation of GRABado 
signal in REM (R) — NREM (NR) versus Wake (W) — NREM. 
symbol represents one recording, and the gray 


from four mice. n.s., 
rank test); Wake versus NRE 
NREM: P = 0.94. (E) 


Each 


Wake 


6/5 


shaded box indicates a <2-fold signal change between the 


indicated brain states. (F) Signal of the GRABag. sensor 
mutant sensor during brain state transitions. The 


and 
vertical dashed lines represent the transition time. 


n = 35, 104, 148, and 29 events (in four mice) for each 


panel, respectively. 


observed changes in extracellular adenosine 
during the sleep-wake cycle. However, the am- 
plitude of evoked adenosine changes was 
highly variable in different trials (Fig. 3G), sug- 
gesting that other cell types in the BF may also 
play a role in regulating extracellular adeno- 
sine during the sleep-wake cycle. 

We thus further examined the role of BF 
VGLUT2+ neurons, using the same “bilateral 
dual probes” method described above for ex- 
pressing GRABago and GCaMP6s in the BF of 
VGLUT2-Cre mice (32) (Fig. 4A and fig. S9). 
The Ca?* signal measured in VGLUT2+ neu- 
rons was significantly correlated with extra- 
cellular adenosine (Fig. 4, B to D; Pearson’s r = 
0.64, P < 0.0001) and preceded the GRABago 
signal by ~41 s (Fig. 4E). Further optogenetic 
activation of VGLUT2+ neurons (Fig. 4F and 
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fig. S10) at a physiologically relevant frequency 
of 20 Hz for 8 s (/4) induced a large and re- 
producible increase in extracellular adenosine 
(Fig. 4, G to I; signal peak: P = 0.036; Upiyo.: P = 
8 x 10°). This release was much larger than 
that induced by activating ChAT+ neurons 
(Fig. 41; ChAT+ versus VGLUT2+, signal peak 
(z-score): 0.76 + 0.20 versus 2.56 + 0.21, P = 
0.0022; Xruo(z-score): 73 + 22 versus 208 + 
17, P = 0.0011). The laser-induced increase in 
extracellular adenosine was likely due to direct 
activation of V@LUT2+ and ChAT+ neurons 
rather than nonspecific effects of the laser 
(e.g., local heating), because no significant 
laser-evoked fluorescence was observed in 
mice expressing only GRABag, but without 
ChrimsonkR (fig. S11; P = 0.45). The difference 
that we observed between adenosine release 
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100 
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100 -100 


evoked by the activation of ChAT+ or V@LUT2+ 
neurons most likely reflects their in vivo abil- 
ity to control the adenosine dynamics during 
the sleep-wake cycle, because we used physi- 
ological firing rates for optogenetic activa- 
tion of ChAT+ and VGLUT2+ neurons (10 and 
20 Hz, respectively) (14, 33). 

We next measured the adenosine transients 
after selectively ablating VGLUT2+ neurons 
in the BF using Caspase-3 to drive cell type- 
specific apoptosis (34). We coinjected AAVs ex- 
pressing GRABago and Cre-dependent Caspase-3 
in the BF of VGLUT2-Cre mice (Fig. 4J). Two 
weeks after injection, when a significant re- 
duction in the number of VGLUT2+ neurons 
was observed in the BF (fig. S12), we found a 
significantly reduced extracellular adenosine 
increase during both wakefulness and REM 
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Fig. 3. Calcium activity in A 
BF cholinergic neurons 
correlates with changes in 
extracellular adenosine. 

(A) Schematic diagram 
depicting fiber photometry 
recording of extracellular 
adenosine levels and population 
Ca** activity of ChAT+ neurons. 
(B) (Top to bottom) EEG power 
spectrogram, EMG (scale, 

0.1 mV), GCaMP fluorescence 
(scale, 1 z-score), and GRABado 
fluorescence (scale, 1 z-score). 
(C) Correlation between the 
size of GCaMP and GRABado 
events. The red line represents 
a linear fit. n = 224 events from 
nine recordings in three mice. 
Pearson's r = 0.83, P < 0.0001. 
The correlation coefficient 

was calculated using raw data 
(rather than using data after log 
transformation); the scatter 
plot is on a logio scale for F 
better visualization; thus, data 
points near zero may not be 

visible; the same analysis was 
applied in Fig. 3D and Fig. 4, 

C and D. (D) Same as in (C) 

after the GCaMP signal was 
randomly shuffled. Pearson's 

r = 0.06, P = 0.32. (E) Time 

course of the GCaMP and 

GRABado signal aligned to the 

onset (left) or offset (right) 

of the GRABago events. 

(F) Schematic diagram depict- 

ing fiber photometry recording 


DIO- fin 
GCaMP6s 


10 


DIO- 


ChAT-Cre 


> Fluor.- Ado (z-score) 


[ Shuffled control 


€ 
N =) 
fal <j 
3 E 
oO 3° 
Oo z / 
8 1 P r N l Ory f l 
10? 108 im 10 10? 108 -100 0 100 -100 0 100 
WN > Fluor- Ado (z-score) Time from Ado event (s) 
G ChAT+ activation Ho, 
“ : a 
| 82F n=6 
Hk 0 NO — 
3 Bol 
oo a 1 1 1 
= Oo 200 
Time from laser onset (s) 
mn mn nn @ I 
(o} s 
is) ~4 _~ 300 
@ © g 
& 8 3] 8 
6 B ® 200 . 
& Bop, & 
Sol, -_ x 3 100 
-50 0 50 100 ° Ty Hi Z 
Time from laser onset (s) ) No 


of extracellular adenosine levels induced by optogenetic activation of ChAT+ neurons. (G@) GRABado signals evoked by optogenetic activation of ChAT+ neurons 
(638 nm laser, 10 ms/pulse, 10 Hz for 8 s). (Upper panel) Heat map plot of nine successive trials; (lower panel) averaged signal; red line, start of the laser 
train. (H) Group summary of laser-evoked GRABado signals. Gray, data of individual recording; green, group average. (I) Quantification of laser-evoked adenosine 
signals in (H). (Left) Peak amplitude (P = 0.014, Student's t test); (right) integrated signal area (P = 0.023, Student's t test.). 


sleep, as compared to control mice (Fig. 4, K 
and L; VGLUT2 lesion versus no-lesion, Wake 
(normalized AF/Fo, norm. z-score): 1.6 + 0.7 
versus 5.3 + 0.6, P = 0.002; NREM: 0.4 + 0.2 
versus 1.0 + 0.1, P = 0.017; REM: 1.4 + 1.0 versus 
7.1 + 0.8, P = 0.0003), further supporting the 
role of BF V@LUT2+ neurons in controlling the 
increase in extracellular adenosine. 


Loss of BF glutamatergic neurons impairs 
sleep homeostasis 


The increase in adenosine during physiologi- 
cal or prolonged wakefulness has been sug- 
gested to powerfully control sleep homeostasis 
(9, 11, 12). Animals with lower activation of 
adenosine signaling may have a slower buildup 
of sleep pressure and exert increased wakeful- 
ness and faster recovery from prolonged wake- 
fulness (35, 36). Loss of V@LUT2+ neurons in 
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the BF (e.g., through selective ablation) might 
thus alter sleep homeostasis. 

To test this hypothesis, we bilaterally ab- 
lated VGLUT2+ neurons in the BF using the 
same method as described above and measured 
changes in the sleep-wake behavior (Fig. 5A). 
Mice with ablated BF VGLUT2+ neurons spent 
significantly more time in wakefulness com- 
pared to littermate controls (Fig. 5, B and C, 
and fig. S13; Lesion versus Control, time in 
wakefulness (%): 66.8 + 1.5 versus 57.5 + 2.6, 
P = 0.0092). This difference was primarily due 
to increased wakefulness specifically during the 
active period (i.e., nighttime), with no signifi- 
cant difference during the inactive period (i.e., 
daytime) (Fig. 5C; Lesion versus Control, time 
in wakefulness (%): nighttime, 91.3 + 1.6 versus 
76.3 + 5.0, P = 0.0041; daytime, 42.2 + 2.2 versus 
38.6 + 0.97, P = 0.16). We observed no apparent 
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difference in the quality of the sleep or wake- 
fulness in the lesion group, as measured by the 
power of electroencephalogram (EEG) slow- 
wave activity (SWA, 0.5 to 4 Hz) during NREM 
sleep or theta activity (6 to 10 Hz) during active 
wakefulness (37) (fig. S14; EEG SWA, P = 0.87; 
EEG theta, P = 0.44), suggesting that the ob- 
served increase in wakefulness in the lesion 
group was not caused by distorted patterns of 
brain oscillations. 

Another measurement of impaired sleep ho- 
meostasis regulation is the change in recovery 
sleep after prolonged wakefulness (J, 38). We 
thus examined whether the loss of V@LUT2+ 
neurons in the BF affects recovery sleep. Mice 
were kept in wakefulness by gentle handling 
(.e., sleep deprivation, SD) in their home cages 
for 6 hours starting from the beginning of the 
light-on period, and recovery sleep was then 
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Fig. 4. Glutamatergic neurons 
in the BF contribute to the 
increase in extracellular 
adenosine during the sleep- 
wake cycle. (A) Schematic 
diagram depicting fiber 
photometry recording of extra- 
cellular adenosine levels and 
population Ca** activity of 
VGLUT2+ neurons. (B) (Top 

to bottom) EEG power spectro- 
gram, EMG (scale, 0.2 mV), 
GCaMP fluorescence (scale, 

1 z-score; light blue, smoothed 
signal), and GRABado fluores- 
cence (scale, 1 z-score). (€ and 
D) Same as Fig. 3, C and D, 
respectively. n = 233 events 
from 10 recordings in six mice. 
In (C), Pearson's r = 0.64, 

P < 0.0001; in (D), Pearson's 

r = -0.07, P = 0.3. (E) Same 
as Fig. 3E. (F to H) Same 

as Fig. 3, F to H, respectively, 
except that VGLUT2+ neurons 
are stimulated. The decrease in 
GRABaao signal after the laser 
onset was independent of 
extracellular adenosine because 
it cannot be blocked by an 
antagonist of the GRABado 
sensor (45) and may be caused 
by activity-dependent PMCA 
effect (46). (I) Quantification of 
laser-evoked adenosine signals. 
Data of the ChAT+ group are 
the same as those in Fig. 3l. 
(Left) P < 0.005 (Wilcoxon 
rank-sum test); (right) 

P < 0.002 (Student's t test). 
(J) Schematic diagram 
depicting the strategy used to 
selectively ablate VGLUT2+ 
neurons in the BF using 

a Cre-dependent Caspase-3. 

(K and L) Summary of GRABado 
signal in mice with ablation 

of VGLUT2+ neurons (K) 
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(n = 8 recordings from eight mice) and in control mice (L) (n = 18 recordings from 18 mice). Lesion versus Control: Wake, P = 0.002 (Wilcoxon rank-sum test); NREM: 
P = 0.017 (Student's t test); REM: P = 0.0003 (Student's t test). 


measured during the subsequent 18 hours 
(Fig. 5D and fig. S15) (39). Although both 
lesion and control mice showed compensatory 
sleep rebound (fig. S16), there was significantly 
less NREM sleep in the lesion group during ZT 
12-16 (Fig. 5, D and E; Lesion versus Control, 
time in NREM (%): 12.9 + 4.0 versus 35.2 + 3.9, 
P = 0.0017), but not for ZT 6-12 (Fig. 5D and E; 
P = 0.90). We also analyzed the time course of 
the NREM SWA and NREM percentage after 
SD by fitting the hourly data with an expo- 
nential decay function (38). The lesion group 
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exhibited a tendency of faster decline in the 
NREM SWA (fig. S17; decline coefficients: 
Lesion, —0.061 + 0.011; Control, —0.033 + 0.012; 
P = 0.11) and a significantly faster decline in 
the NREM sleep time (Fig. 5F and G; decline 
coefficients: Lesion, -0.18 + 0.011; Control, 
—0.094 + 0.014; P = 2.7 x 107%). 


Discussion 


Here, we reported the design and character- 
ization of a genetically encoded adenosine 
sensor (GRABag,) with high sensitivity and 
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specificity, and high temporal resolution; by 
combining the GRABa,, and fiber photometry 
imaging with bilateral dual probes, together 
with optogenetic manipulation and cell type- 
specific lesion, we demonstrated a neuronal 
type-specific control of fast adenosine dynam- 
ics during the sleep-wake cycle and uncovered 
a critical role of the VGLUT2+ neurons in the 
BF in controlling adenosine dynamics and 
sleep homeostasis. 

The cell type-specific control of extracellular 
adenosine levels in the BF suggests a distinct 
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Fig. 5. Loss of BF VGLUT2+ A 
neurons impairs sleep 
homeostasis. (A) Schematic 
diagram depicting the strategy 
used to selectively ablate 
VGLUT2+ neurons in the BF. 
(B) Circadian variation of 
wakefulness in lesion and 
control mice. n = 7 mice per 
group. *P < 0.05, **P < 0.01, 
and ***P < 0.001 (Wilcoxon 


rank-sum test or Student's aul 
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lesion and control mice were subjected to sleep deprivation (SD) for 6 hours (ZTO-6). n = 7 mice per group. *P < 0.05 and **P < 0.01 (Wilcoxon rank-sum 

test or Student's t test). (E) Summary of the percentage of NREM sleep during the two selected periods. **P < 0.01 and nss., not significant. ZT6-12, P = 0.90 
(Wilcoxon rank-sum test); ZT12-16, P = 0.0017, (Student's t test). (F) Time course showing the decay in the percentage of NREM sleep in each hour after SD using the 
same data in (D). Dashed lines, the exponential fit of data from individual mice; solid line, group average. (G) Fitting coefficients from the data in (F) (smaller 
coefficient means a faster decline in hourly sleep percentage). P = 2.7 x 10°“ (Student's t test). 


contribution to the sleep-wake regulation by 
different wake-active neurons in the BF. The 
long delay in the increase of extracellular 
adenosine following neural activation may 
provide a time window for the activation of 
BF neural circuits, while still maintaining 
a feedback inhibition (40) for stabilizing the 
network. 

Our results that the loss of BF V@LUT2+ 
neurons have a larger effect on the balance 
between the duration of sleep and wakeful- 
ness, but not the SWA or the recovery sleep, 
are consistent with the notion that adenosine 
regulation of SWA is primarily caused by its 
direct modulation of neural activity in the 
thalamocortical system (12, 41, 42). This result 
suggests a dissociation in the regulation of dif- 
ferent features of the sleep-wake cycle. 

Together, our findings offer new insights 
into the mechanisms by which neural activity 
during wakefulness contributes to the increase 
in sleep pressure (43, 44) by stimulating the 
release of somnogenic factors. 


Materials and Methods 
Design and characterization of GRABygo 


The cDNAs encoding various subtypes of 
Ado receptor were amplified from the human 
GPCR cDNA library, and the third intracel- 
lular loop (ICL3) of each receptor was replaced 
with the ICL3 of GRAByg. The insertion sites 
on AsaR and the amino acid composition be- 
tween A»aR and ICL3 of GRAByg were sys- 
tematically screened to obtain GRABago10- 
GRABago1.0 Was then expressed in HEK293T 
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cells, HeLa cells, or cultured rat primary neu- 
rons for further characterization of its sensitivity 
and specificity, its coupling with intracellular 
signaling pathways, and effects of its expres- 
sion on neuronal physiology. 


Animals and surgical procedures 


All animal experimental procedures followed 
guidelines of the National Institutes of Health 
and were approved by the Animal Care and 
Use Committee at Peking University, or the 
Institute of Neuroscience, Chinese Academy of 
Sciences. Both male and female mice (>7 weeks 
at the time of surgery) were used for in vivo 
experiments. AAV virus (0.2 to 0.4 ul) was 
stereotaxically injected into the BF using a 
glass pipette micro-injector through a craniot- 
omy. For fiber photometry and optogenetic 
activation experiments, optical fibers were 
inserted into the BF using the same coordi- 
nate for virus injections. EEG and EMG elec- 
trodes were attached according to standard 
procedures. All implants were secured using 
dental cement. Experiments were carried out 
at least 1 week after surgery. 


Polysomnography recordings 


The EEG and EMG signals were recorded 
using TDT amplifiers with a high-pass filter 
at 0.5 Hz and digitized at 1500 Hz. The brain 
states were scored every 5 s semi-automatically 
in MATLAB using fast Fourier transform (FFT) 
spectral analysis with a frequency resolution 
of 0.18 Hz, and the results were validated 
manually by trained experimenters accord- 
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ing to established criteria. For recording with 
fiber photometry, experiments were carried 
out in home cages, and each session lasted 
~3 hours. For long-term recording, mice were 
connected to the system using a commutator 
and habituated for at least 3 days before a 
3-day recording period, followed by 6-hour sleep 
deprivation and recovery sleep for 30 hours. 
Sleep deprivation was achieved using gentle 
handling methods. 


Fiber photometry recording and analysis 


GRABago and GCaMP fluorescence was recorded 
using fiber photometry with lock-in detec- 
tion. The fiber photometry rig was built using 
parts from Doric Lens, and the lock-in detec- 
tion was implemented in the TDT RZ2 sys- 
tem using the fiber photometry “Gizmo” of the 
Synapse software. The demodulated signal 
was low-pass filtered at 20 Hz and stored using 
a sampling frequency of 1017 Hz. To analyze 
the photometry data, we first down-sampled 
the raw data to 1 Hz and subtracted the back- 
ground autofluorescence. We then calculated 
the AF/Fp using a baseline obtained by fitting 
the autofluorescence-subtracted data with a 
second-order exponential function. Finally, 
we used a MATLAB script “BEADS” with a cut- 
off frequency of 0.00035 cycles per sample 
to remove the slow drift and identify fast com- 
ponents. To quantify the GRABaqo signal across 
different animals, the z-score transformed 
AF/Fo was further normalized using the stan- 
dard deviation of the signal during NREM 
sleep. 
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Statistics 


A normality test was first performed on each 
dataset using the Shapiro-Wilk test. The para- 
metric tests were used if the dataset was nor- 
mally distributed; otherwise, nonparametric 
tests were used. All the statistical tests were 
two-tailed and performed in MATLAB. Data 
in the fiber photometry experiments were ex- 
cluded based on post-hoc verification of the 
virus expression and the position of optical 
fibers. In the long-term polysomnographic re- 
cording experiments, one mouse was excluded 
for analysis because of abnormal EEG and EMG 
signals. The investigators were not blinded to 
the genotypes or the experimental conditions 
of the animals. 


Further details of the materials and methods 


can be found in the supplementary materials. 
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Deep immune profiling of COVID-19 patients reveals 
distinct immunotypes with therapeutic implications 
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Jennifer E. Wu*, Cécile Alanio* et al. 


INTRODUCTION: Many patients with corona- 
virus disease 2019 (COVID-19), caused by severe 
acute respiratory syndrome coronavirus 2 (SARS- 
CoV-2) infection, present with severe respiratory 
disease requiring hospitalization and mechanical 
ventilation. Although most patients recover, dis- 
ease is complex and case fatality can be as high 
as 10%. How human immune responses con- 
trol or exacerbate COVID-19 is currently poorly 
understood, and defining the nature of immune 
responses during acute COVID-19 could help 
identify therapeutics and effective vaccines. 


RATIONALE: Immune dysregulation during SARS- 
CoV-2 infection has been implicated in patho- 
genesis, but currently available data remain 
limited. We used high-dimensional cytometry 
to analyze COVID-19 patients and compare 
them with recovered and healthy individuals 
and performed integrated analysis of ~200 im- 
mune features. These data were combined with 
~50 clinical features to understand how the 
immunology of SARS-CoV-2 infection may be 
related to clinical patterns, disease severity, and 
progression. 


High-dimensional immune response 
analysis of COVID-19 patients iden- 


tifies three immunotypes. Peripheral n=60 

blood mononuclear cell immune Recovered 

profiling and clinical data were Donors rcmiRe Ge 
collected from 60 healthy donors ais 

(HDs), 36 recovered donors (RDs), COVID-19 “an. 
and 125 hospitalized COVID-19 iar 


patients. High-dimensional flow 
cytometry and longitudinal analysis 
highlighted stability and fluctuations in 
the response. UMAP visualization 
distilled ~200 immune features into 
two dimensions and identified three 
immunotypes associated with clinical 
outcomes. cTfh, circulating T follicular 
helper cells; EMRA, a subset of 
effector memory T cells reexpressing 
CD45RA; dO, day 0. 


Component 2 


High-Dimensional 


RESULTS: Analysis of 125 hospitalized COVID-19 
patients revealed that although CD4 and CD8 
T cells were activated in some patients, T cell 
responses were limited in others. In many pa- 
tients, CD4 and CD8 T cell proliferation (mea- 
sured by K167 increase) and activation (detected 
by CD38 and HLA-DR coexpression) were con- 
sistent with antiviral responses observed in other 
infections. Plasmablast (PB) responses were pres- 
ent in many patients, reaching >30% of total B 
cells, and most patients made SARS-CoV-2- 
specific antibodies. However, ~20% of patients 
had little T cell activation or PB response com- 
pared with controls. In some patients, responses 
declined over time, resembling typical kinetics 
of antiviral responses; in others, however, robust 
T cell and PB responses remained stable or in- 
creased over time. These temporal patterns were 
associated with specific clinical features. With an 
unbiased uniform manifold approximation and 
projection (UMAP) approach, we distilled ~200 
immune parameters into two major immune 
response components and a third pattern lacking 
robust adaptive immune responses, thus reveal- 
ing immunotypes of COVID-19: (i) Immunotype 1 


Healthy : HD 
Donors : 


Severity 


Clinical Data 
~50 features 


High-Dimensional Flow Cytometry 
& Longitudinal Monitoring 
27 parameters, ~200 features 


was associated with disease severity and showed 
robust activated CD4 T cells, a paucity of circulating 
follicular helper cells, activated CD8 “EMRAs,” hy- 
peractivated or exhausted CD8 T cells, and PBs. (ii) 
Immunotype 2 was characterized by less CD4 T cell 
activation, Tbet* effector CD4: and CD8 T cells, and 
proliferating memory B cells and was not associated 
with disease severity. (iii) Inmunotype 3, which neg- 
atively correlated with disease severity and lacked 
obvious activated T and B cell responses, was also 
identified. Mortality occurred for patients with all 
three immunotypes, illustrating a complex relation- 
ship between immune response and COVID-19. 


CONCLUSION: Three immunotypes revealing dif- 
ferent patterns of lymphocyte responses were 
identified in hospitalized COVID-19 patients. 
These three major patterns may each represent 
a different suboptimal response associated with 
hospitalization and disease. Our findings may 
have implications for treatments focused on ac- 
tivating versus inhibiting the immune response. 
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Deep immune profiling of COVID-19 patients reveals 
distinct immunotypes with therapeutic implications 


Divij Mathew:*, Josephine R. Giles'?3*, Amy E. Baxter’?*, Derek A. Oldridge’**, 

Allison R. Greenplate’*, Jennifer E. Wu*”**, Cécile Alanio"?**, Leticia Kuri-Cervantes*°, 

M. Betina Pampena*, Kurt D’Andrea®, Sasikanth Manne?, Zeyu Chen?, Yinghui Jane Huang*?, 
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Coronavirus disease 2019 (COVID-19) is currently a global pandemic, but human immune responses to 
the virus remain poorly understood. We used high-dimensional cytometry to analyze 125 COVID-19 
patients and compare them with recovered and healthy individuals. Integrated analysis of ~200 immune 
and ~50 clinical features revealed activation of T cell and B cell subsets in a proportion of patients. 

A subgroup of patients had T cell activation characteristic of acute viral infection and plasmablast 
responses reaching >30% of circulating B cells. However, another subgroup had lymphocyte activation 
comparable with that in uninfected individuals. Stable versus dynamic immunological signatures were 
identified and linked to trajectories of disease severity change. Our analyses identified three 
immunotypes associated with poor clinical trajectories versus improving health. These immunotypes 
may have implications for the design of therapeutics and vaccines for COVID-19. 


he coronavirus disease 2019 (COVID-19) 

pandemic has, to date, caused >23 million 

infections resulting in more than 800,000 

deaths. After infection with severe acute 

respiratory syndrome coronavirus 2 
(SARS-CoV-2), COVID-19 patients can experi- 
ence mild or even asymptomatic disease or 
can present with severe disease requiring 
hospitalization and mechanical ventilation. 
The case fatality rate can be as high as ~10% 
(1). Some severe COVID-19 patients display 
acute respiratory distress syndrome (ARDS), 
which reflects severe respiratory damage. In 
acute respiratory viral infections, pathology 
can be mediated by the virus directly, by an 
overaggressive immune response, or both 
(2-4). However, in severe COVID-19, the 
characteristics and role of the immune re- 
sponse, as well as how these responses relate 
to clinical disease features, remain poorly 
understood. 


SARS-CoV-2 antigen-specific T cells have 
been identified in the central memory (CM), 
effector memory (EM), and CD45RA* effec- 
tor memory (EMRA) compartments (5), but 
the characteristics of these cells and their role 
in infection or pathogenesis remain unclear. 
Recovered individuals more often have evidence 
of virus-specific CD4 T cell responses than virus- 
specific CD8 T cell responses, though preexisting 
CD4 T cell responses to other coronaviruses also 
are found in a subset of people in the absence 
of SARS-CoV-2 exposure (6). Inflammatory 
responses—such as increases in interleukin- 
6 (IL-6)-producing or granulocyte-macrophage 
colony-stimulating factor (GM-CSF)-producing 
CD4 T cells in the blood (7) or decreases in 
immunoregulatory subsets such as regulatory 
T cells (Tyeg) or y5 T cells (8-17)—have been 
reported. T cell exhaustion (72, 73) and in- 
creased inhibitory receptor expression on 
peripheral T cells have also been reported 


(7, 14), though these inhibitory receptors are 
also increased after T cell activation (75). Al- 
though there is evidence of T cell activation in 
COVID-19 patients (6), some studies have found 
decreases in polyfunctionality (12, 17) or cyto- 
toxicity (12), but these changes have not been 
observed in other studies (13). How this activa- 
tion should be viewed in the context of COVID- 
19 lymphopenia (78-20) remains unclear. 

Most patients seroconvert within 7 to 
14 days of infection, and increased plasma- 
blasts (PBs) have been reported (16, 21-23). 
However, the role of humoral responses in 
the pathogenesis of COVID-19 is still unclear. 
Whereas immunoglobulin G (IgG) levels re- 
portedly drop slightly ~8 weeks after symptom 
onset (24, 25), recovered patients maintain 
high spike protein-specific IgG titers (6, 26). 
IgA levels also can remain high and may cor- 
relate with disease severity (25, 27). Further- 
more, neutralizing antibodies can control 
SARS-CoV-2 infection in vitro and in vivo 
(4, 28, 29). Indeed, convalescent plasma that 
contains neutralizing antibodies can improve 
clinical symptoms (30). However, not all patients 
that recover from COVID-19 have detectable 
neutralizing antibodies (6, 26), which suggests 
a complex relationship between humoral and 
cellular response in COVID-19 pathogenesis. 

Taken together, this previous work provokes 
questions about the potential diversity of im- 
mune responses to SARS-CoV-2 and the rela- 
tionship of this diversity to clinical disease. 
However, many studies describe small cohorts 
or even single patients, thus limiting a com- 
prehensive investigation of this diversity. The 
relationship of different immune response 
features to clinical parameters, as well as the 
changes in immune responses and clinical 
disease over time, remains poorly understood. 
Because potential therapeutics for COVID-19 
patients include approaches to inhibit, acti- 
vate, or otherwise modulate immune function, 
it is essential to define the immune response 
characteristics related to disease features in 
well-defined patient cohorts. 


Acute SARS-CoV-2 infection in humans 
results in broad changes in circulating 
immune cell populations 


We conducted an observational study of 
hospitalized patients with COVID-19 at the 
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and quantification of major immune subsets. (A) Overview of patient cohorts —_ regions represent the clinical normal range and normal range based on study 
in our study, including HDs, RDs, and COVID-19 patients. (B) Quantification of HDs, respectively. The vertical dashed line indicates the clinical threshold for 
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(D) Representative flow cytometry plots and (E) frequencies of major immune unpaired Wilcoxon test with Benjamini-Hochberg (BH) correction: *P < 0.05, 
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University of Pennsylvania (UPenn IRB 808542) 
that included 149 adults with confirmed SARS- 
CoV-2 infection (i.e, COVID-19 patients) (Fig. 
1A). Blood was collected at enrollment (typically 
~24 to 72 hours after admission). Additional 
samples were obtained from patients who 
remained hospitalized on day 7 (D7). Blood 
was also collected from nonhospitalized 
patients who had recovered from documented 
SARS-CoV-2 infection [recovered donors (RDs); 
n = 46], as well as from healthy donors (HDs; 
n = 70) (UPenn IRB 834263) (Fig. 1A). Clinical 
metadata are available from the COVID-19 
patients over the course of disease (table S1). 
Flow cytometry data from peripheral blood 
mononuclear cells (PBMCs), as well as clinical 
metadata, were collected from a subset of pa- 
tients and donors: COVID-19 patients (n = 
125), RDs (n = 36), and HDs (n = 60) (Fig. 1A 
and tables S2 to S4). 

COVID-19 patients had a median age of 60 
and were significantly older than HDs and 
RDs (median ages of 41 and 29, respectively), 
though the age distributions for all three 
cohorts overlapped (Fig. 1A and fig. S1A). For 
COVID-19 patients, median body mass index 
was 29 (range: 16 to 78), and 68% of these 
patients were African American (table S2). 
Comorbidities in COVID-19 patients were domi- 
nated by cardiovascular risk factors (83% of 
the cohort). Nearly 20% of patients suffered 
from chronic kidney disease, and 18% had a 
previous thromboembolic event. A subset of 
patients (18%) were immunosuppressed, 
and 7 and 6% of patients were known to have 
a diagnosis of cancer or a preexisting pulmo- 
nary condition, respectively. Forty-five percent 
of the patients were treated with hydroxy- 
chloroquine (HCQ), 31% with steroids, and 
29% with remdesivir. Eighteen individuals died 
during their hospital stay or within 30 days of 
admission. The majority of the patients were 
symptomatic at diagnosis and were enrolled 
~9 days after initiation of symptoms. Approx- 
imately 30% of patients required mechani- 
cal ventilation at presentation, with additional 
extracorporeal membrane oxygenation in 
four cases. 

As has been reported for other COVID-19 
patients (37), this COVID-19 cohort presented 
with a clinical inflammatory syndrome. C-reactive 
protein (CRP) was elevated in more than 90% 
of individuals and lactate dehydrogenase and 
D-dimer were increased in the majority, whereas 
ferritin was above normal in ~75% of COVID- 
19 patients (Fig. 1B and fig. SIB). Similarly, 
troponin and NT-proBNP were increased in 
some patients (fig. SIB). IL-6 levels, measured 
in a subset of patients, were normal in 5 patients, 
moderately elevated in 5 patients (6 to 20 pg/ml), 
and high in 31 patients (21 to 738 pg/ml) (fig. 
S1B). Although white blood cell (WBC) counts 
were mostly normal, individual leukocyte pop- 
ulations were altered in COVID-19 patients (Fig. 
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1B). A subset of patients had high polymorpho- 
nuclear leukocyte (PMN) counts (fig. S1B), as 
described previously (8, 32) and in a companion 
study (33). Furthermore, approximately half of 
the COVID-19 patients were clinically lymphopenic 
(absolute lymphocyte count <1000/ul; Fig. 1B). 
By contrast, monocyte, eosinophil, and basophil 
counts were mostly normal (Fig. 1B and fig. S1B). 

To examine potential associations between 
these clinical features, we performed corre- 
lation analysis (Fig. 1C and fig. S1C). This 
analysis revealed correlations between differ- 
ent COVID-19 severity metrics, as well as clinical 
features or interventions associated with more- 
severe disease (e.g., D-dimer, vasoactive med- 
ication) (Fig. 1C and fig. SIC). WBCs and PMNs 
also correlated with metrics of disease severity 
(e.g., APACHE III) as well as with IL-6 levels 
(Fig. 1C and fig. S1C). Other relationships were 
also apparent, including correlations between 
age or mortality and metrics of disease severity 
and many other correlations between clinical 
measures of disease, inflammation, and comor- 
bidities (Fig. 1C and fig. SIC). Thus, COVID-19 
patients presented with varied preexisting 
comorbidities, complex clinical phenotypes, 
evidence of inflammation in many patients, 
and clinically altered leukocyte counts. 

To begin to investigate immune responses 
to acute SARS-CoV-2 infection, we compared 
PBMCs of COVID-19 patients, RDs, and HDs 
by using high-dimensional flow cytometry. We 
first focused on the major lymphocyte pop- 
ulations. B cell and CD3 T cell frequencies 
were decreased in COVID-19 patients com- 
pared with HDs or RDs, reflecting clinical 
lymphopenia, whereas the relative frequency 
of non-B and non-T cells was correspondingly 
elevated (Fig. 1, D and E). Although a numerical 
expansion of a non-B, non-T cell type is pos- 
sible, loss of lymphocytes likely results in an 
increase in the relative frequency of this popu- 
lation. This non-B, non-T cell population is also 
probed in more detail in the companion study 
(33). Examining only CD3 T cells revealed 
preferential loss of CD8 T cells compared with 
CD4 T cells (Fig. 1, F and G, and fig. S1D); this 
pattern was reflected in absolute numbers 
estimated from the clinical data, where both 
CD4 and CD8 T cell counts in COVID-19 pa- 
tients were lower than the clinical reference 
range, though the effect was more prominent 
for CD8 T cells (49 of 61 individuals with 
below-normal levels) than for CD4 T cells (38 
of 61 individuals with below-normal levels) 
(fig. SIE). These findings are consistent with 
previous reports of lymphopenia during COVID- 
19 (17-20) but highlight a preferential impact 
on CD8 T cells. 

We next asked whether the changes in these 
lymphocyte populations were related to clin- 
ical metrics (Fig. 1H). Lower WBC counts were 
associated preferentially with lower frequen- 
cies of CD4 and CD8 T cells and increased 
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non-T, non-B cells, but not with B cells (Fig. 
1H). These lower T cell counts were associated 
with clinical markers of inflammation, includ- 
ing ferritin, D-dimer, and high-sensitivity CRP 
(hsCRP) (Fig. 1H), whereas altered B cell fre- 
quencies were not. Thus, hospitalized COVID- 
19 patients present with a complex constellation 
of clinical features that may be associated with 
altered lymphocyte populations. 


SARS-CoV-2 infection is associated with CD& 
T cell activation in a subset of patients 


We next applied high-dimensional flow cyto- 
metric analysis to further investigate lymphocyte 
activation and differentiation during COVID-19. 
We first used principal components analysis 
(PCA) to examine the general distribution of 
immune profiles from COVID-19 patients (7 = 
118), RDs (n = 60), and HDs (n = 36) using 
193 immune parameters identified by high- 
dimensional flow cytometry (tables S5 and S6). 
COVID-19 patients were clearly separated from 
RDs and HDs in PCA space, whereas RDs and 
HDs largely overlapped (Fig. 2A). We inves- 
tigated the immune features that drive this 
COVID-19 immune signature. Given the role 
of CD8 T cells in response to viral infection, 
we focused on this cell type. Six major CD8 
T cell populations were examined by using 
the combination of CD45RA, CD27, CCR7, and 
CD95 cell surface markers to define naive 
(CD45RA*CD27'CCR7‘CD95 ), central memory 
[CD45RA CD27°CCR7* (CM)], effector memory 
[CD45RA CD27'CCR7 (EM1), CD45RA CD27— 
CCR7* (EM2), CD45RA CD27 CCR7 (EM3)], 
and EMRA (CD45RA*CD27 CCR7 ) (Fig. 2B) 
CD8 T cells. Among the CD8 T cell populations, 
there was an increase in the EM2 and EMRA 
populations and a decrease in EM1 (Fig. 2C). 
Furthermore, the frequency of CD39" cells was 
increased in COVID-19 patients compared with 
HDs (Fig. 2D). Although the frequency of PD-1* 
cells was not different in the total CD8 popu- 
lation (Fig. 2D), it was increased for both 
CM and EM1 (fig. S2A). Finally, all major CD8 
T cell naive and memory populations in RDs 
were comparable to those in HDs (Fig. 2, C 
and D, and fig. S2A). 

Most acute viral infections induce prolifer- 
ation and activation of CD8 T cells detectable 
by increases in KI67 or coexpression of CD38 
and HLA-DR (34, 35). There was a significant 
increase in KI67* and also HLA-DR*CD38* 
non-naive CD8 T cells in COVID-19 patients 
relative to HDs or RDs (Fig. 2, E and F). In 
COVID-19 patients compared with HDs and 
RDs, KI67* CD8 T cells were increased across 
all subsets of non-naive CD8 T cells, including 
CM and EM1 populations (fig. S2B). These data 
indicate broad T cell activation, potentially 
driven by bystander activation and/or homeo- 
static proliferation in addition to antigen-driven 
activation of virus-specific CD8 T cells. This 
activation phenotype was confirmed by HLA-DR 
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for CD8 T cell subsets. (C) Frequencies of CD8 T cell subsets as indicated. 

(D) Frequencies of PD-1* and CD39* cells. Frequencies of (E) KI67* and 

(F) HLA-DR*CD38" cells and representative flow cytometry plots. The green line 

in the left panels denotes the upper decile of HDs. (G) (Top) Global viSNE projection 
of non-naive CD8 T cells for all participants pooled, with non-naive CD8 T cells from 
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(MFI) as indicated (column-scaled z-scores). (J) Percentage of non-naive CD8 T cells 
from each cohort in each FlowSOM cluster. Boxes represent interquartile ranges 
(IQRs). (C, D, E, F, and J) Each dot represents an individual HDs (green), RDs (blue), 
or COVID-19 patient (red). Significance was determined by unpaired Wilcoxon test 
with BH correction: *P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001. 
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and CD38 coexpression that was signifi- 
cantly increased for all non-naive CD8 T cell 
subsets (Fig. 2F and fig. S2C). However, the 
magnitude of the KI67* or CD38*HLA-DR* 
CD8 T cells varied widely in this cohort. The 
frequency of KI67* CD8 T cells correlated 
with the frequency of CD38*HLA-DR* CD8 
T cells (fig. S2D). However, the frequency of 
CD38*HLA-DR* T cells, but not KI67* CD8 
T cells, was elevated in COVID-19 patients who 
had concomitant infection with another mi- 
crobe but was not affected by preexisting immu- 
nosuppression or treatment with steroids (fig. 
S2E). Moreover, these changes in CD8 T cell 
subsets in COVID-19 patients did not show 
clear correlations with individual metrics of 
clinical disease such as hsCRP or D-dimer (fig. 
S2E), although the frequency of KI67* CD8 
T cells was associated with elevated IL-6 and 
ferritin levels. Although CD8 T cell activation 
was common, ~20% of patients had no increase 
in KI67* or CD38*HLA-DR* CD8 T cells above 
the level found in HDs (Fig. 2, E and F). Thus, 
although robust CD8 T cell activation was a 
clear characteristic of many hospitalized COVID- 
19 patients, a substantial fraction of patients 
had little evidence of CD8 T cell activation in 
the blood compared with controls. 

To gain more insights, we applied global 
high-dimensional mapping of the 27-parameter 
flow cytometry data. A t-distributed stochastic 
neighbor embedding (tSNE) representation of 
the data highlighted key regions of non-naive 
CD8 T cells found preferentially in COVID-19 
patients (Fig. 2G). A major region of this tSNE 
map present in COVID-19 patients, but not 
HDs or RDs, encompasses CD8 T cells enriched 
for expression of CD38, HLA-DR, KI67, CD39, 
and PD-1 (Fig. 2G), highlighting the coexpres- 
sion of these activation markers with other 
features, including CD95 (i.e., FAS). Notably, 
although non-naive CD8 T cells from RDs 
were highly similar to those from HDs, subtle 
differences existed, including in the lower re- 
gion highlighted by T-bet and CX3CR1 (Fig. 2G). 
To further define and quantify these differences 
between COVID-19 patients and controls, we 
performed FlowSOM clustering (Fig. 2H) and 
compared expression of 14 CD8 T cell markers 
to identify each cluster (Fig. 21). This approach 
identified an increase in cells in several clus- 
ters, including clusters 1, 2, and 5 in COVID-19 
patients, reflecting CD45RA*CD27 CCR7~ 
EMRA-like populations that expressed CX3CRI1 
and varying levels of T-bet (Fig. 2, I and J) 
(“EMRA” denotes a subset of effector mem- 
ory T cells reexpressing CD45RA). Clusters 12 
and 14 contained CD27* HLA-DR*CD38*KI67* 
PD-1* activated, proliferating cells and were 
more prevalent in COVID-19 patients (Fig. 2, I 
and J, and fig. S2F). By contrast, the central 
Eomes*CD45RA CD27'CCR7 EM1-like clus- 
ter 6 and T-bet™CX3CRI* cluster 11 were de- 
creased in COVID-19 patients compared with 
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HDs (Fig. 2, land J, and fig. S2F). Thus, CD8 
T cell responses in COVID-19 patients were 
characterized by populations of activated, 
proliferating CD8 T cells in a subgroup of 
patients. 


SARS-CoV-2 infection is associated with 
heterogeneous CD4 T cell responses and 
activation of CD4 T cell subsets 


We next examined six well-defined CD4 T cell 
subsets as above for the CD8 T cells, includ- 
ing naive; EM1, -2, and -3; CM; and EMRA 
(Fig. 3A). Given the potential role of antibodies 
in the response to SARS-CoV-2 (27, 29), we 
also analyzed circulating T follicular helper 
(Tpy) cells [CD45RA PD-1*CXCR5* (cT py) (36)] 
and activated circulating Ty cells [CD38* 
ICOS* (activated cTyy)], the latter of which 
may be more reflective of recent antigen en- 
counter and emigration from the germinal 
center (37, 38) (Fig. 3A). These analyses re- 
vealed a relative loss of naive CD4 T cells com- 
pared with controls, but increased EM2 and 
EMRA (Fig. 3B). The frequency of activated but 
not total cTy;; cells was statistically increased 
in COVID-19 patients compared with HDs, 
though this effect appeared to be driven by a 
subgroup of patients (Fig. 3B). Notably, acti- 
vated cTpy frequencies were also higher in 
RDs than in HDs (Fig. 3B), perhaps reflect- 
ing residual COVID-19 responses in that group. 
Frequencies of KI67* or CD38*HLA-DR* non- 
naive CD4 T cells were increased in COVID-19 
patients (Fig. 3, C and E); however, this change 
was not equivalent across all CD4 T cell sub- 
sets. The most substantial increases in both 
KI67* and CD38*HLA-DR* cells were found in 
the effector memory populations (EM1, EM2, 
EM8) and in cTpy cells (fig. S3, A and B). 
Although some individuals had increased ac- 
tivation of EMRA, this response was less pro- 
nounced. By contrast, PD-1 expression was 
increased in all other non-naive populations 
compared with HDs or RDs (fig. S3C). Co- 
expression of CD38 and HLA-DR by non-naive 
CD4 T cells correlated with the frequency of 
KI67" non-naive CD4 T cells (fig. S3D). More- 
over, the frequency of total non-naive CD4 
T cells that were CD38*HLA-DR* correlated 
with the frequency of activated cTyy cells 
(fig. S3E). In general, the activation of CD4 
T cells was correlated with the activation 
of CD8 T cells (Fig. 3, D and F). However, 
whereas about two-thirds of COVID-19 pa- 
tients had KI67* non-naive CD4 or CD8 T cell 
frequencies above controls, about one-third 
had no increase in frequency of KI67* CD4 
or CD8 T cells above that observed in HDs 
(Fig. 3, D and F). Moreover, although most 
patients had similar proportions of activated 
CD4 and CD8 T cells, a subgroup of patients 
had disproportionate activation of CD4 T cells 
relative to CD8 T cells (Fig. 3, D and F). KI67* 
and CD38*HLA-DR* non-naive CD4 T cell 
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frequencies correlated with ferritin and with 
APACHE III score (fig. S3F), suggesting a 
relationship between CD4 T cell activation 
and disease severity. Immunosuppression did 
not affect CD4 T cell activation; however, 
early steroid administration was weakly as- 
sociated with CD4 T cell KI67 expression 
(fig. S3F). Together, these data indicate that 
T cell activation in COVID-19 patients is sim- 
ilar to what has been observed in other acute 
infections or vaccinations (37, 39, 40) and 
identify patients with high, low, or essen- 
tially no T cell response on the basis of KI67* 
or CD38*HLA-DR* expression compared with 
control individuals. 

Projecting the global CD4 T cell differenti- 
ation patterns into the high-dimensional tsNE 
space again identified major alterations in the 
CD4 T cell response in COVID-19 patients com- 
pared with HDs and RDs (Fig. 3G). In COVID- 
19 infection, there was a notable increase in 
density in tSNE regions that mapped to ex- 
pression of CD38, HLA-DR, PD1, CD39, KI67, 
and CD95 (Fig. 3G), similar to CD8 T cells. To 
gain more insight into these CD4 T cell changes, 
we again used a FlowSOM clustering approach 
(Fig. 3, H and I). This analysis identified an 
increase in clusters 13 and 14 (representing 
populations that express HLA-DR, CD38, PD1, 
KI67 and CD95) as well as cluster 15 (contain- 
ing Tbet*CX3CRI1" effector-like CD4 T cells) in 
COVID-19 patients compared with HDs and 
RDs (Fig. 3, I and J, and fig. S3G). By contrast, 
this clustering approach identified reduction 
in CXCR5* cTyy,like cells (clusters 2 and 3) in 
COVID-19 participants compared with HDs 
(Fig. 3, I and H). Collectively, the results of this 
multidimensional analysis reveal distinct pop- 
ulations of activated and proliferating CD4 
T cells that were enriched in COVID-19 patients. 

A key feature of COVID-19 is thought to be 
an inflammatory response that, at least in some 
patients, is linked to clinical disease manifes- 
tation (2, 4) and high levels of chemokines and 
cytokines, including IL-IRA, IL-6, IL-8, IL-10, 
and CXCLIO (11, 41). To investigate the poten- 
tial connection of inflammatory pathways to 
T cell responses, we performed 31-plex Luminex 
analysis on paired plasma and culture super- 
natants of wCD3- and wCD28-stimulated PBMCs 
from a subset of COVID-19 patients and HD 
controls. Owing to biosafety restrictions, we 
were able to study only eight COVID-19 pa- 
tient blood samples that were confirmed nega- 
tive for SARS-CoV-2 RNA by polymerase chain 
reaction (PCR) (fig. S4A). Half of these COVID- 
19 patients had plasma CXCL10 concentrations 
that were ~15 times as high as those of HD 
controls, whereas the remainder showed only 
a limited increase (fig. S4B). CKCL9, CCL2, and 
IL-1RA were also significantly increased. By con- 
trast, chemokines involved in the recruitment of 
eosinophils (eotaxin) or activated T cells (CCL5) 
were decreased. IL-6 was not elevated in this 
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Fig. 3. CD4 T cell activation in a subset of COVID-19 patients is associated 
with distinct CD4 T cell subsets. (A) Representative flow cytometry plots of 
the gating strategy for CD4 T cell subsets. (B) Frequencies of CD4 T cell 
subsets, as indicated. (C) Frequencies of KI67* cells. The green line in the left 
panel denotes the upper decile of HDs. Representative flow cytometry plots are 
shown at right. (D) KI67* cells from non-naive CD4 T cells versus non-naive 
CD8 T cells. Spearman correlation of COVID-19 patients is shown. (E) Frequencies 
of HLA-DR*CD38* cells. The green line in the left panel denotes the upper 

decile of HDs. Representative flow cytometry plots are shown at right. 

(F) HLA-DR*CD38* cells from non-naive CD4 versus non-naive CD8 T cells, 


group of patients, in contrast to the subset of 
individuals tested clinically (fig. S1B), poten- 
tially because IL-6 was measured in the hos- 
pital setting, often when systemic inflammation 
was suspected. After stimulation in vitro, 
PBMCs from COVID-19 patients produced 
more CCL2, CXCL10, eotaxin, and IL-IRA than 
those from HDs (fig. S4, C and D), and con- 
centrations of CXCL1O and CCL2 correlated 
between the matched supernatant from stimu- 
lated PBMCs and plasma samples (fig. S4E). 
Finally, we investigated whether CD8 T cells 
from COVID-19 patients were capable of pro- 
ducing interferon-y (IFNy) after polyclonal 
stimulation. After stimulation with aCD3 and 
aCD28, similar proportions of CD8 T cells 
from COVID-19 patients and HD controls 
produced IFNy, which suggests that PBMCs 
from COVID-19 patients were responsive to 
T cell receptor cross-linking (fig. S4, F to H). The 
ability of T cells to produce IFNy after stim- 
ulation occurred in patients with increases in 
K167 as well as patients with low KI67 (fig. S4, 
F to H). Taken together, these data support the 
notion that a subgroup of COVID-19 patients 
has elevated systemic cytokines and chemokines, 
including myeloid-recruiting chemokines. 


COVID-19 infection is associated with 
increased frequencies of PBs and proliferation 
of memory B cell subsets 


B cell subpopulations were also altered in 
people with COVID-19. Whereas naive B cell 
frequencies were similar in COVID-19 patients 
and RDs or HDs, the frequencies of class- 
switched (IgDCD27*) and not-class-switched 
(IgD*CD27*) memory B cells were significantly 
reduced (Fig. 4A). Conversely, frequencies of 
CD27 IgD" B cells and CD27°CD38* PBs were 
often markedly increased (Fig. 4, A and B). In 
some cases, PBs represented >30% of circulat- 
ing B cells, similar to levels observed in acute 
Ebola or dengue virus infections (42, 43). How- 
ever, these PB responses were observed in only 
about two-thirds of patients, with the remain- 
ing patients displaying PB frequencies similar 
to those in HDs and RDs (Fig. 4B). K167 ex- 
pression was markedly elevated in all B cell 
subpopulations in COVID-19 patients compared 
with either control group (Fig. 4C). This obser- 
vation suggests a role for an antigen-driven 
response to infection- and/or lymphopenia- 
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driven proliferation. Higher K167 levels in PBs 
may reflect recent generation in COVID-19 
patients relative to HDs or RDs. CXCR5 ex- 
pression was also reduced on all major B cell 
subsets in COVID-19 patients (Fig. 4D). Loss 
of CXCR5 was not specific to B cells, however, 
as expression was also decreased on non-naive 
CD4 T cells (Fig. 4E). Changes in the B cell 
subsets were not associated with coinfection, 
immune suppression, or treatment with steroids 
or other clinical features, though a possible neg- 
ative association of IL-6 and PBs was revealed 
(fig. S5A). These observations suggest that the 
B cell response phenotype of COVID-19 was 
not simply due to systemic inflammation. 
During acute viral infections or vaccination, 
PB responses are transiently detectable in the 
blood and correlate with cT;,; responses (40). 
Comparing the frequency of PBs to the fre- 
quency of total cT}; cells or activated cTry 
cells, however, revealed a weak correlation 
only with activated cTpy cells (Fig. 4F and 
fig. S5, B and C). Furthermore, some patients 
had robust activated cT;y responses but PB 
frequencies similar to those of controls, whereas 
other patients with robust PB responses had 
relatively low frequencies of activated cTry 
cells (Fig. 4F and fig. S5, B and C). There was 
also an association between PB frequency and 
CD38*HLA-DR* or KI67* CD4 T cells that 
might reflect a role for non-CXCR5* CD4 T cell 
help (fig. S5D), but such a relationship did not 
exist for the equivalent CD8 T cell populations 
(fig. SSE). Although ~70% of the COVID-19 pa- 
tients analyzed in our study made antibodies 
against SARS-CoV-2 spike protein [79 of 111 
IgG; 77 of 115 IgM (44)], antibody levels did not 
correlate with PB frequencies (Fig. 4G and fig. 
S5F). The occasional lack of antibody did not 
appear to be related to immunosuppression in 
a small number of patients (fig. S5G). The lack 
of PB correlation with antibody suggests that a 
proportion of these large PB responses were: 
(i) generated against SARS-CoV-2 antigens 
other than the spike protein or (ii) inflamma- 
tion driven and perhaps nonspecific or low 
affinity. Notably, anti-SARS-CoV-2 IgG and 
IgM levels correlated with the activated, but 
not total, cTyy response, which suggests that 
at least a proportion of cTyy cells were pro- 
viding SARS-CoV-2-specific help to B cells 
(Fig. 4, H and I, and fig. $5, H and J). Al- 
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Spearman correlation of COVID-19 patients is shown. (G) (Top) Global viSNE 
projection of non-naive CD4 T cells for all participants pooled, with non-naive 
CD4 T cells from HDs, RDs, and COVID-19 patients concatenated and overlaid. 
(Bottom) viSNE projections of indicated protein expression. (H) viSNE projection 
of non-naive CD4 T cell clusters identified by FlowSOM clustering. (I) MFI as 
indicated (column-scaled z-scores). (J) Percentage of non-naive CD4 T cells 
from each cohort in each FlowSOM cluster. Boxes represent IQRs. (B, C, E, and J) 
Each dot represents an individual HDs (green), RDs (blue), or COVID-19 patient 
(red). Significance was determined by unpaired Wilcoxon test with BH correction: 
*P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001. 


though defining the precise specificity of the 
robust PB populations will require future 
studies, these data suggest that at least some 
of the PB response is specific for SARS-CoV-2. 

Projecting the flow cytometry data for B cells 
from HDs, RDs, and COVID-19 patients in 
tSNE space revealed a distinct picture of B cell 
populations in COVID-19 patients compared 
with controls, whereas populations in RDs 
and HDs were similar (Fig. 4J and fig. S5J). 
The COVID-19 patient B cell phenotype was 
dominated by the loss of CKCR5 and IgD com- 
pared with B cells from HDs and RDs (Fig. 4J). 
Moreover, the robust PB response was appar- 
ent in the upper right section, highlighted by 
CD27, CD38, CD138, and KI67 (Fig. 4J). The 
expression of KI67 and CD95 in these CD27" 
CD38*CD138" PBs (Fig. 4J) may suggest recent 
generation and/or emigration from germinal 
centers. We next asked whether there were 
different groups of COVID-19 patients (or HDs 
and RDs) with global differences in the B cell 
response. We used the Earth mover’s distance 
(EMD) metric (45) to calculate similarities 
between the probability distributions within 
the tSNE map (Fig. 4J) and clustered data so 
that individuals with the most-similar distri- 
butions grouped together (Fig. 4K). The ma- 
jority of COVID-19 patients fell into two distinct 
groups (EMD groups 1 and 3; Fig. 4L), sug- 
gesting two major immunotypes of the B cell 
response. The remainder of the COVID-19 pa- 
tients (~25%) clustered with the majority of 
the HD and all of the RD controls, supporting 
the observation that some individuals had 
limited evidence of response to infection in 
their B cell compartment. To identify the pop- 
ulation differences between HDs, RDs, and 
COVID-19 patients, we performed FlowSOM 
clustering on the tSNE map and overlaid each 
individual EMD group onto this same tSNE 
map (Fig. 4, M and N). EMD group 2, con- 
taining mostly HDs and RDs, was enriched for 
naive B cells (IgD*CD27,, cluster 10) and 
CXCR5*IgD°CD27* switched memory cells 
(cluster 2), and indeed, clusters 2 and 10 were 
statistically reduced in COVID-19 patients 
(Fig. 4P). EMD groups 1 and 3 displayed dis- 
tinct patterns across the FlowSOM clusters. 
B cells from individuals in EMD group 1 were 
enriched for FlowSOM clusters 1, 5, and 6, all 
of which were increased in COVID-19 patients 
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Fig. 4. Deep profiling of 


COVID-19 patient B cell populations reveals robust 


PB populations and other B cell alterations. (A) Gating strategy and 
frequencies of non-PB B cell subsets. (B) Representative flow cytometry plots 


and frequencies of PBs. 


he green line in the right panel denotes the upper 


decile of HDs. (C) Representative flow cytometry plots and frequencies of 
KI67* B cells. (D) (Left) Representative histograms of CXCR5 expression; (right) 


CXCR5 geometric MFI (G 


Fl) of B cell subsets. (E) CXCR5 GMFI of non-naive 


CD4 T cells and cTry cells. (F) Spearman correlation between PBs and activated 


cTry Cells. (@) Spearman 
(H and I) Spearman corre 


correlation between PBs and anti-SARS-CoV-2 IgG. 
ation between activated cTy cells and anti-SARS-CoV-2 


overlaid. (Bottom) viSNE projections of expression of the indicated proteins. 

(K) Hierarchical clustering of EMD using Pearson correlation, calculated pairwise 
for B cell populations for all participants (row-scaled z-scores). (L) Percentage 
of cohort in each EMD group. (M) Global viSNE projection of B cells for all 
participants pooled, with EMD groups 1 to 3 concatenated and overlaid. 

(N) B cell clusters identified by FlowSOM clustering. (0) MFI as indicated 
(column-scaled z-scores). (P) Percentage of B cells from each cohort in each 
FlowSOM cluster. Boxes represent IQRs. (A to F and P) Dots represent individual 
HDs (green), RDs (blue), or COVID-19 (red) participants. (A to E and P) 
Significance was determined by unpaired Wilcoxon test with BH correction: 


(H) IgM and (I) IgG. (J) (Top) Global viSNE projection of B cells for all 
participants pooled, with B cell populations of each cohort concatenated and 


(Fig. 4P). FlowSOM clusters 1 and 6 captured 
T-bet* memory B cells, whereas FlowSOM clus- 
ter 5 contained the CD27'CD38*CD138*KI67* 
PBs, all of which were enriched in COVID-19 
patients relative to controls (Fig. 4, O and P, 
and fig. S5K). By contrast, B cells from COVID- 
19 patients in EMD group 3 also showed en- 
richment for the PB FlowSOM cluster 5, though 
less prominent than for EMD group 1, but the 
T-bet* memory B cell cluster 1 was substan- 
tially reduced in EMD group 3. Thus, B cell 
responses—most often characterized by ele- 
vated PBs, decreases in memory B cell subsets, 
enrichment in a T-bet’ B cell subset, and loss 
of CXCR5 expression—were evident in many 
hospitalized COVID-19 patients. Whether all 
of these changes in the B cell compartment 
were due to direct antiviral responses is un- 
clear. Although there was heterogeneity in the 
B cell responses, COVID-19 patients fell into 
two distinct patterns containing activated B cell 
responses and a third group of patients with 
little evidence of an active B cell response. 


Temporal changes in immune cell populations 
occur during COVID-19 


A key question for hospitalized COVID-19 pa- 
tients is how immune responses change over 
time. Thus, we used the global tSNE projec- 
tions of overall CD8 T cell, CD4 T cell, and 
B cell differentiation states to investigate 
temporal changes in these populations be- 
tween DO and D7 of hospitalization (Fig. 5A). 
Combining data for all patients revealed con- 
siderable stability of the tSNE distributions 
between DO and D7 in CD8 T cell, CD4 T cell, 
and B cell populations, particularly for the 
key regions of interest discussed above. For 
example, for CD8 T cells, the region of the 
tSNE map containing KI67* and CD38*HLA- 
DR* CD8 T cell populations that was enriched 
in COVID-19 patients at DO (Fig. 2) was pre- 
served at D7 (Fig. 5A). A similar temporal sta- 
bility of CD4 T cell and B cell activation was 
also observed (Fig. 5A). 

Given this apparent stability between DO 
and D7, we next investigated temporal changes 
in lymphocyte subpopulations of interest. 
Although there were no obvious temporal 
changes in major phenotypically defined CD4 
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and CD8 T cell or B cell subsets, including PBs 
(Fig. 5D), the frequencies of HLA-DR*CD38* 
and KI67* non-naive CD4 (Fig. 5B) and KI67* 
non-naive CD8 T cells were statistically in- 
creased at D7 compared with DO (Fig. 5C). 
However, in all cases, these temporal pat- 
terns were complex, with frequencies of sub- 
populations in individual patients appearing 
to increase, decrease, or stay the same over 
time. To quantify these interpatient changes, 
we used a previously described dataset (46) to 
define the stability of populations of interest 
in healthy individuals over time. We then 
used the range of this variation over time to 
identify COVID-19 patients with changes in 
immune cell subpopulations beyond that ex- 
pected in healthy people (see Materials and 
methods section). With this approach, ~50% of 
patients had an increase in HLA-DR*CD38* 
non-naive CD4 T cells over time, whereas these 
cells were stable in ~30% of patients and de- 
creased in the remaining ~20% (Fig. 5E). For 
K1I67* non-naive CD8 T cells, there were no 
individuals in whom the response decreased. 
Instead, this proliferative CD8 T cell response 
stayed stable (~70%) or increased (~30%) (fig. 
S6A). Notably, for patients in the stable cat- 
egory, the median frequency of KI67* non- 
naive CD8 T cells was ~10%, almost 10 times 
as high as the ~1% detected for HDs and RDs 
(Figs. 5C and 2E), suggesting a sustained CD8 
T cell proliferative response to infection. A sim- 
ilar pattern was observed for HLA-DR*CD38* 
non-naive CD8 cells (fig. S6B): Only ~10% of 
patients had a decrease in this population, 
whereas ~65% were stable and ~25% had an 
increase over time. The high and even increas- 
ing activated or proliferating CD8 and CD4 
T cell responses over ~1 week during acute 
viral infection contrasted with the sharp peak 
of KI67 in CD8 and CD4 T cells during acute 
viral infections, including smallpox vaccina- 
tion with live vaccinia virus (47), live attenu- 
ated yellow fever vaccine YFV-17D (48), acute 
influenza virus infection (49), and acute HIV 
infection (35). Approximately 42% of patients 
had sustained PB responses, at high levels 
(>10% of B cells) in many cases (Fig. 5F). 
Thus, some patients displayed dynamic changes 
in T cell or B cell activation over 1 week in the 
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*P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001. (G to |) The black 
horizontal line represents the positive threshold. 


hospital, but other patients remained stable. In 
the latter case, some patients remained stable 
without clear activation of key immune popula- 
tions, whereas others had stable T and/or B cell 
activation or numerical perturbation (fig. S6C). 

We next asked whether these T and B cell 
dynamics are related to clinical measures of 
COVID-19. To do this, we correlated changes 
in immune features from DO to D7 with clin- 
ical information (Fig. 5G). These analyses re- 
vealed distinctive correlations. Decreases in 
all populations of responding CD4 and CD8 
T cells (HLA-DR*CD38", KI67", and activated 
cTry) between DO and D7 were positively 
correlated with PMN and WBC counts, suggest- 
ing a relationship between T cell activation and 
lymphopenia. Furthermore, decreases in CD4 
and CD8 HLA-DR*CD38* T cells positively cor- 
related with APACHE III score. However, stable 
HLA-DR*CD38* CD4 T cell responses corre- 
lated with coagulation complications and fer- 
ritin levels. Whereas decreasing activated cT yy 
cells over time was related to coinfection, the 
opposite pattern was observed for PBs. In- 
creases in proliferating KI67* CD4 and CD8 
T cells over time were positively correlated to 
increasing anti-SARS-CoV-2 antibody from 
DO to D7, suggesting that some individuals 
might have been hospitalized during the ex- 
pansion phase of the antiviral immune re- 
sponse (Fig. 5G). Finally, neither remdesivir 
nor HCQ treatment correlated with any of 
these immune features (Fig. 5G). When we 
examined categorical rather than continuous 
clinical data, we found that 80% of patients 
with decreasing PBs over time had hyperlip- 
idemia, whereas only 20% of patients with 
increasing PBs over time had this comorbid- 
ity (fig. S6D). All patients who had decreasing 
CD38*HLA-DR* CDs T cells from DO to D7 
were treated with early vasoactive medication 
or inhaled nitric oxide, whereas these treat- 
ments were less common for patients with 
stable or increasing CD38*HLA-DR* CDs T cells 
(fig. S6E). By contrast, vasoactive medication, 
inhaled nitric oxide, and early steroid treat- 
ment were equally common in patients with 
increasing or decreasing PBs (fig. S6D). Sim- 
ilar patterns were apparent for other T cell 
populations and these categorical clinical data 
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Fig. 5. Temporal relationships between immune responses and disease 
manifestation. (A) Global viSNE projection of non-naive CD8 T cells, non-naive 
CD4 T cells, and B cells for all participants pooled, with cells from COVID-19 
patients at DO and D7 concatenated and overlaid. Frequencies of (B) KI67* and 
HLA-DR*CD38* CD4 T cells, (C) KI67* and HLA-DR*CD38* CD8 T cells, or 

(D) PBs as indicated for HDs (green), RDs (blue), or COVID-19 patients (red), 
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with paired samples at DO and D7 indicated by connecting lines. Significance 
was determined by paired Wilcoxon test: *P < 0.05, **P < 0.01, ***P < 0.001, 
and ****P < 0.0001. Longitudinal patterns (see Materials and methods) of 

(E) HLA-DR*CD38* CD4 T cells or (F) PBs in COVID-19 patients shown as 
frequency and representative flow cytometry plots. (@) Spearman correlations of 
clinical parameters with longitudinal fold changes in immune populations. 
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Fig. 6. High-dimensional analysis of immune phenotypes with clinical data 
reveals distinct COVID-19 patient immunotypes. (A) NIH ordinal scale for 
COVID-19 clinical severity. (B) Frequencies of major immune subsets. 
Significance was determined by unpaired Wilcoxon test with BH correction: 

*P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001. (C) Heatmap of 
indicated immune parameters by row; donor type, disease severity, and 
mortality are indicated across the top. (D) UMAP projection of aggregated 
flow cytometry data. (E) Transformed UMAP projection. Density contours were 
drawn separately for HDs, RDs, and COVID-19 patients (see Materials and 
methods). (F) Bars represent mean of UMAP component 1. Dots represent 
individual participants; bars are color-coded by participant group and/or severity 
score. (G) Density contour plots indicating variation of specified immune 


(fig. S6F). Thus, the trajectory of change in the 
T and B cell response in COVID-19 patients 
was strongly connected to clinical metrics 
of disease. 


Identifying immunotypes and relationships 
between circulating B and T cell responses 
with disease severity in COVID-19 patients 


To further investigate the relationship between 
immune responses and COVID-19 trajectory, 
we stratified the COVID-19 patients (m = 125) 
into eight different categories, according to 
the NIH Ordinal Severity Scale, ranging from 
COVID 1 (death) and COVID 2 (requiring 
maximal clinical intervention) to COVID 8 (at 
home with no required care) (Fig. 6A). We then 
asked how changes in T and B cell populations 
defined above on DO were related to disease 
severity. More severe disease was associated 
with lower frequencies of CD8 and CD4 T cells, 
with a greater effect on CD8 T cells in less 
severe disease (Fig. 6B). Taking all patients 
together, there were no statistically significant 
changes in the major T cell and B cell subsets 
related to disease severity, though some trends 
were present (fig. S7, A to C). By contrast, HLA- 
DR*CD38* CD8 T cells as well as both KI67* 
and HLA-DR*CD38* CD4T cells were increased 
in patients with more severe disease (fig. S7, 
D and E). 

There were two challenges with extracting 
meaning from these data. First, there was 
considerable interpatient heterogeneity for 
each of these immune features related to dis- 
ease severity score. Second, these binary com- 
parisons (e.g., one immune subset versus one 
clinical feature) do not make full use of the 
high-dimensional information in this dataset. 
Thus, we next visualized major T and B cell 
subpopulation data as related to clinical dis- 
ease severity score (Fig. 6C). Data were clus- 
tered according to immune features and then 
overlaid with the disease severity score over 
time for each patient. This analysis revealed 
groups of patients with similar composite im- 
mune signatures of T and B cell populations 
(Fig. 6C). When individual CD8 T cell, CD4 
T cell, or B cell populations were examined, a 
similar concept of patient subgroups emerged 
(fig. S7, F, G, and H). These data suggested the 
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idea of immunotypes of COVID-19 patients on 
the basis of integrated responses of T and 
B cells, though some individual cell types 
and/or phenotypes separated patients more 
clearly than others. 

These approaches provided insight into po- 
tential immune phenotypes associated with 
patients with severe disease but were hin- 
dered by the small number of manually se- 
lected T or B cell subsets or phenotypes. We 
therefore next employed uniform manifold 
approximation and projection (UMAP) to dis- 
till the ~200 flow cytometry features (tables 
S5 and S6) representing the immune land- 
scape of COVID-19 in two-dimensional space, 
creating compact meta-features (or compo- 
nents) that could then be correlated with 
clinical outcomes. This analysis revealed a 
clear trajectory from HDs to COVID-19 pa- 
tients (Fig. 6D), which we centered and aligned 
with the horizontal axis (component 1) to 
facilitate downstream analysis (Fig. 6E). An 
orthogonal vertical axis coordinate (compo- 
nent 2) captured nonoverlapping aspects of 
the immune landscape. We next calculated 
the mean of component 1 for each patient 
group, with COVID-19 patients separated by 
severity score (Fig. 6E). The contribution of 
component 1 clearly increased in a stepwise 
manner with increasing disease severity 
(Fig. 6F). Notably, RDs were subtly positioned 
between HDs and COVID-19 patients. Com- 
ponent 1 remained an independent predictor 
of disease severity (P = 5.5 x 10°”) even after 
adjusting for the confounding demographic 
factors of age, sex, and race. 

We next investigated how the UMAP com- 
ponents were associated with individual im- 
mune features (tables S5 and S6). UMAP 
component 1 captured immune features, in- 
cluding the relative loss of CD4 and CD8 T cells 
and increase in the ratio of non-B and non-T 
cells to T and B cells (Fig. 6G). PBs also as- 
sociated with component 1 (Fig. 6G). Other 
individual B cell features were differential- 
ly captured by UMAP components 1 and 2. 
Component 1 contained a signal for T-bet* PB 
populations (table $5), whereas component 2 
was enriched for T-bet* memory B cells and 
CD138* PB populations (table S6). Activated 
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features across UMAP component coordinates. Relative expression (according to 
heat scale) is shown for both individual patients (points) and overall density 
(contours). Spearman's rank correlation coefficient (p) and P value for each 
nt 1 (C1) and component 2 (C2) are shown. (H) (Left) 
Spearman correlation between UMAP components 1 and 2 and FlowSOM 
clusters. (Right) Select FlowSOM clusters and their protein expression. 

(1) Spearman correlation between UMAP components 1 and 2 and clinical 
metadata. (J) Heatmap of immune parameters used to define immunotype 3 
e severity and mortality are indicated across the top. 
UMAP projection; patient status for immunotype 3 
indicated by color. (Right) Spearman correlation between immunotype 3 and 
ty, and UMAP components. 


HLA-DR*CD38* and KI67* CD4: and CD8 T cells 
had contributions to both components, with 
these features residing in the upper right cor- 
ner of the UMAP plot (Fig. 6, G and H, and fig. 
S8, A to D). By contrast, T-bet* non-naive CD8 
T cells were strongly associated with compo- 
nent 2, whereas T-bet* non-naive CD4 T cells 
were linked to component 1 (Fig. 6G and tables 
S5 and S6). Eomes* CD8 or CD4:T cells were 
both associated with component 2 and nega- 
tively associated with component 1 (Fig. 6G 
and tables S5 and S6). 

We next took advantage of the FlowSOM 
clustering in Figs. 2 to 4 that identified in- 
dividual immune cell types most perturbed in 
COVID-19 patients and linked these FlowSOM 
clusters to UMAP components (Fig. 6H). For 
non-naive CD8 T cells, FlowSOM cluster 11, 
which contained T-bet*CX3CRI1* but non- 
proliferating effector-like cells, was positively 
correlated with UMAP component 2 and neg- 
atively correlated with component 1 (Fig. 6H). 
By contrast, FlowSOM cluster 14, which con- 
tained activated, proliferating PD-1*CD39* 
cells that might reflect either recently gen- 
erated effector or exhausted CD8 T cells (50), 
was strongly associated with UMAP compo- 
nent 1 (Fig. 6H). For CD4 T cells, FlowSOM 
cluster 14, containing activated, proliferating 
CD4: T cells, was captured by both UMAP com- 
ponents, whereas a second activated CD4 T cell 
population that also expressed CD95 (FlowSOM 
cluster 13) was captured by only UMAP com- 
ponent 1 (Fig. 6H). In addition, component 
1 was negatively correlated with CD4 T cell 
FlowSOM clusters 2 and 3 that contained 
cTyy cells (Fig. 6H). Finally, for B cells, the 
FlowSOM cluster of T-bet*CD138* PBs (clus- 
ter 5) was positively correlated with com- 
ponent 1, whereas the T-bet"CD138" cluster 3 
was negatively correlated with this same com- 
ponent (Fig. 6H). Locations in the UMAP im- 
mune landscape were dynamic, changing from 
DO to D7 for both components, consistent with 
the data in Fig. 5 and fig. S9, A to F. The most 
dynamic changes in component 1 were asso- 
ciated with the largest increases in IgM anti- 
body levels (fig. S9G). 

Given the association of UMAP component 
1 with disease severity, we next examined the 
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connections between UMAP components and 
individual clinical features. UMAP compo- 
nent 1 correlated with several clinical mea- 
surements of inflammation (e.g., ferritin, hsCRP, 
IL-6), coinfection, organ failure (APACHE IID, 
and acute kidney disease and renal insuffi- 
ciency (Fig. 61). However, although D-dimer 
level was elevated, this feature did not corre- 
late with UMAP component 1, whereas co- 
agulation complication did (Fig. 61). Several 
antibody features also correlated with com- 
ponent 1, consistent with some of the immune 
features discussed above. By contrast, compo- 
nent 2 lacked positive correlation to many 
of these clinical features of disease and was 
negatively correlated to eosinophil count, non- 
steroidal anti-inflammatory drug (NSAID) use, 
and subsequent treatment with remdesivir 
(Fig. 61). UMAP component 1, not compo- 
nent 2, also correlated with mortality, although 
there were clearly patients with high compo- 
nent 2 but low component 1 who died from 
COVID-19 (Fig. 6E). These data indicate that 
the immune features captured by UMAP com- 
ponent 1 have a strong relationship to many 
features of disease severity, whereas other 
features of immune dynamics during COVID-19 
captured by UMAP component 2 have a distinct 
relationship with clinical disease presentation. 
More-positive values in UMAP components 
1 or 2 captured mainly signals of change or 
differences in individual immune features in 
COVID-19 patients compared with HDs and 
RDs. UMAP component 1 captured an immu- 
notype (immunotype 1) characterized by effec- 
tor or highly activated CD4 T cells, low cTry 
cells, some CD8 EMRA-like activation, possi- 
bly hyperactivated CD8 T cells, and Tbet* PBs, 
whereas component 2 (immunotype 2) cap- 
tured Tbet"£"" effector-like CD8 T cells, had 
less robust CD4 T cell activation, and had some 
features of proliferating B cells (Fig. 6G and 
fig. S8). However, the data presented in Figs. 
1 to 5 also suggested a subset of patients with 
minimal activation of T and B cell responses. 
To investigate this immune signature, we 
identified 20 patients who had responses 
more similar to those of HDs and RDs for 
five activated or responding B and T cell pop- 
ulations (Fig. 6J, middle, and fig. S10). If 
UMAP components 1 and 2 captured two dis- 
tinct immunotypes of patient responses to 
SARS-CoV-2 infection, this group of 20 pa- 
tients represents a third immunotype. Immuno- 
type 3 was negatively associated with UMAP 
components 1 and 2 and negatively associated 
with disease severity, which suggests that a less 
robust immune response during COVID-19 was 
associated with less severe pathology (Fig. 6K 
and fig. S10), despite the fact that these patients 
were hospitalized with COVID-19. These data 
further emphasize the different ways in which 
patients can present with and possibly die from 
COVID-19. These patterns may be related to 


Mathew et al., Science 369, eabc8511 (2020) 


preexisting conditions in combination with im- 
mune response characteristics. It is likely that 
additional immune features, such as compre- 
hensive serum cytokine measurements, will 
improve this model. Nevertheless, the current 
computational approach integrating deep im- 
mune profiling with disease severity trajec- 
tory and other clinical information revealed 
distinct patient immunotypes linked to dis- 
tinct clinical outcomes (fig. S11). 


Discussion 


The T and B cell response to SARS-CoV-2 in- 
fection remains poorly understood. Some 
studies suggest that an overaggressive immune 
response leads to immunopathology (51), 
whereas others suggest that the mechanism 
is T cell exhaustion or dysfunction (12-14). 
Autopsies revealed high virus levels in the 
respiratory tract and other tissues (52), 
suggesting ineffective immune responses. 
Nevertheless, nonhospitalized individuals 
who recovered from COVID-19 had evidence 
of virus-specific T cell memory (53). SARS- 
CoV-2-specific antibodies are also found in 
convalescent individuals, and patients are 
currently being treated with convalescent 
plasma therapy (30, 54). However, COVID- 
19 patients in intensive care units (ICUs) 
have SARS-CoV-2-specific antibodies (30), 
raising the question of why patients with 
these antibody responses are not controlling 
disease. In general, these previous studies 
have reported on single patients or small 
cohorts and thus do not achieve compre- 
hensive deep immune profiling of larger 
numbers of hospitalized COVID-19 patients. 
Such knowledge would address the critical 
question of whether there is a common pro- 
file of immune dysfunction in critically ill 
patients. Such data would also help guide 
testing of therapeutics to enhance, inhibit, 
or otherwise tailor the immune response 
in COVID-19 patients. 

To elucidate the immune response patterns 
of hospitalized patients with COVID-19, we 
studied a cohort of ~125 patients. We used 
high-dimensional flow cytometry to perform 
deep immune profiling of individual B and 
T cell populations, with temporal analysis 
of immune changes during infection, and 
combined this profiling with extensive clin- 
ical data to understand the relationships be- 
tween immune responses to SARS-CoV-2 and 
disease severity. This approach led us to sev- 
eral key findings. First, a defining feature 
of COVID-19 in hospitalized patients is het- 
erogeneity of the immune response. Many 
COVID-19 patients displayed robust CD8 
T cell and/or CD4 T cell activation and pro- 
liferation and PB responses, though a substan- 
tial subgroup of patients (~20%) had minimal 
detectable responses compared with controls. 
Furthermore, even within those patients who 
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mounted detectable B and T cell responses 
during COVID-19, the immune characteristics 
of the responses were heterogeneous. With the 
use of deep immune profiling, we identified 
three immunotypes in hospitalized COVID-19 
patients: (i) robust activation and prolifera- 
tion of CD4 T cells, relative lack of CT cells, 
modest activation of EMRA-like cells, highly 
activated or exhausted CD8 T cells, and a 
signature of T-bet* PBs (immunotype 1); (ii) 
Thet?"2" effector-like CD8 T cell responses, 
less robust CD4 T cell responses, and Ki67* 
PBs and memory B cells (immunotype 2); and 
Gii) an immunotype largely lacking detectable 
lymphocyte response to infection, which sug- 
gests a failure of immune activation (immuno- 
type 3). UMAP embedding further resolved 
the T cell-activation immunotype, suggesting 
a link between CD4: T cell activation, immuno- 
type 1, and increased severity score. Although 
differences in age and race existed between 
the cohorts and could affect some immune 
variables, the major UMAP relationships were 
preserved even after correcting for these var- 
iables. Thus, these immunotypes may reflect 
fundamental differences in the ways in which 
patients respond to SARS-CoV-2 infection. 

Asecond key observation from these studies 
was the robust PB response. Some patients 
had PB frequencies rivaling those found in 
acute Ebola or dengue infection (34, 42, 43, 55). 
Furthermore, blood PB frequencies are typi- 
cally correlated with blood-activated cTyy re- 
sponses (40). However, in COVID-19 patients, 
this relationship between PBs and activated 
cTry cells was weak. The lack of relationship 
between these two cell types in this disease 
could be due to T cell-independent B cell re- 
sponses, lack of activated cTyy cells in periph- 
eral blood at the time point analyzed, or lower 
CXCR5 expression observed across lympho- 
cyte populations, making it more difficult to 
identify cTpy cells. Activated (CD38*HLA- 
DR*) CD4T cells could play a role in providing 
B cell help, perhaps as part of an extrafollicular 
response, but such a connection was not 
robust in the current data. Most ICU patients 
made SARS-CoV-2-specific antibodies, sug- 
gesting that at least part of the PB response 
was antigen specific. Indeed, the cTyy re- 
sponse did correlate with antibodies, which 
indicates that at least some of the humoral 
response is targeted against the virus. Future 
studies will be needed to address the antigen 
specificity, ontogeny, and role in pathogenesis 
for these robust PB responses. 

A notable feature of some patients with 
strong T and B cell activation and proliferation 
was the durability of the PB response. This 
T and B cell activation was interesting con- 
sidering the clinical lymphopenia in many 
patients. However, this lymphopenia was pref- 
erential for CD8 T cells. It may be notable that 
such focal lymphopenia preferentially affecting 
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CD8 T cells is also a feature of acute Ebola in- 
fection of macaques and is associated with CD95 
expression and severe disease (55). Indeed, 
CD95 was associated with activated T cell clus- 
ters in COVID-19. Nevertheless, the frequency 
of the KI67* or CD38*HLA-DR* CD8 and CD4 
T cell responses in COVID-19 patients was sim- 
ilar in magnitude to those of other acute viral 
infections or live attenuated vaccines in hu- 
mans (47-49). However, during many acute 
viral infections, the period for peak CD8 or CD4: 
T cell responses and the window for PB detec- 
tion in peripheral blood are relatively short 
(43, 56, 57). The stability of CD8 and CD4 T cell 
activation and PB responses during COVID-19 
suggests a prolonged period of peak immune 
responses at the time of hospitalization or per- 
haps a failure to appropriately down-regulate 
responses in some patients. These ideas would 
fit with an overaggressive immune response 
and/or “cytokine storm” (2) in this subset of 
patients. Indeed, in some patients, we found 
elevated serum cytokines and that stimulation 
of T cells in vitro provoked cytokines and che- 
mokines capable of activating and recruiting 
myeloid cells. A key question will be how to 
identify these patients for selected immune- 
regulatory treatment while avoiding treating pa- 
tients with already weak T and B cell responses. 

An additional major finding was the ability 
to connect immune features to disease severity 
at the time of sampling as well as to the tra- 
jectory of disease severity change over time. 
Using correlative analyses, we observed rela- 
tionships between features of the different 
immunotypes, patient comorbidities, and clin- 
ical features of COVID-19. By integrating ~200 
immune features with extensive clinical data, 
disease severity scores, and temporal changes, 
we built an integrated computational model 
that connected patient immune response 
phenotype to disease severity. This UMAP 
embedding approach allowed us to connect 
these integrated immune signatures to specific 
clinically measurable features of disease. The 
integrated immune signatures captured by 
components 1 and 2 in this UMAP model pro- 
vided support for the concept of immunotypes 
1 and 2. These analyses suggested that im- 
munotype 1—composed of robust CD4 T cell 
activation, paucity of CTpy cells with prolifer- 
ating effector or exhausted CD8 T cells, and 
T-bet* PB involvement—was connected to 
more-severe disease, whereas immunotype 
2—characterized by more traditional effector 
CD8 T cells subsets, less CD4 T cell activation, 
and proliferating PBs and memory B cells— 
was better captured by UMAP component 2. 
Immunotype 3, in which minimal lympho- 
cyte activation response was observed, may 
represent ~20% of COVID-19 patients and is a 
potentially important scenario to consider for 
patients who may have failed to mount a 
robust antiviral T and B cell response. This 
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UMAP integrated modeling approach could 
be improved in the future with additional 
data on other immune cell types and/or com- 
prehensive data for circulating inflammatory 
mediators for all patients. Nevertheless, these 
findings provoke the idea of tailoring clinical 
treatments or future immune-based clinical 
trials to patients whose immunotype suggests 
greater potential benefit. 

Respiratory viral infections can cause pa- 
thology as a result of an immune response that 
is too weak, resulting in virus-induced pathol- 
ogy, or too strong, leading to immunopathology 
(58). Our data suggest that the immune re- 
sponse of hospitalized COVID-19 patients may 
fall across this spectrum of immune response 
patterns, presenting as distinct immunotypes 
linked to clinical features, disease severity, and 
temporal changes in response and patho- 
genesis. This study provides a compendium 
of immune response data and an integrated 
framework to connect immune features to 
disease. By localizing patients on an immune 
topology map built on this dataset, we can begin 
to infer which types of therapeutic interventions 
may be most useful in specific patients. 


Materials and methods 
Patients, participants, and clinical data 
collection 


Patients admitted to the Hospital of the 
University of Pennsylvania with a positive 
SARS-CoV-2 PCR test were screened and ap- 
proached for informed consent within 3 days 
of hospitalization. Healthy donors (HDs) were 
adults with no prior diagnosis of or recent 
symptoms consistent with COVID-19. Normal 
reference ranges for HDs were the University 
of Pennsylvania clinical laboratory values 
shaded in green in Fig. 1B. Recovered donors 
(RDs) were adults with a prior positive COVID- 
19 PCR test by self-report who met the defi- 
nition of recovery by the Centers for Disease 
Control and Prevention. HDs and RDs were 
recruited initially by word of mouth and sub- 
sequently through a centralized University of 
Pennsylvania resource website for COVID-19- 
related studies. Peripheral blood was collected 
from all participants. For inpatients, clinical 
data were abstracted from the electronic med- 
ical record into standardized case report 
forms. ARDS was categorized in accordance 
with the Berlin Definition, reflecting each in- 
dividual’s worst oxygenation level and with 
physician adjudication of chest radiographs. 
APACHE III scoring was based on data col- 
lected in the first 24 hours of ICU admission 
or the first 24 hours of hospital admission 
for participants admitted to general inpatient 
units. Clinical laboratory data were abstracted 
from the date closest to that of research blood 
collection. HDs and RDs completed a survey 
about symptoms. After enrollment, the clinical 
team determined three patients to be COVID- 
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negative and/or PCR false-positive. Two of 
these patients were classified as immuno- 
type 3. In keeping with inclusion criteria, 
these individuals were maintained in the 
analysis. The statistical significance reported 
in Fig. 6K did not change when analysis was 
repeated without these three patients. All 
participants or their surrogates provided in- 
formed consent in accordance with protocols 
approved by the regional ethical research 
boards and the Declaration of Helsinki. 


Sample processing 


Peripheral blood was collected into sodium 
heparin tubes (BD, catalog no. 367874). Tubes 
were spun [15 min, 3000 rpm, room temper- 
ature (RT)], and plasma was removed and 
banked. Remaining whole blood was diluted 
1:1 with 1% RPMI (table S7) and layered into 
a SEPMATE tube (STEMCELL Technologies, 
catalog no. 85450) preloaded with lymphoprep 
(STEMCELL Technologies, catalog no. 1114547). 
SEPMATE tubes were spun (10 min, 1200xg, RT), 
and the PBMC layer was collected, washed with 
1% RPMI (10 min, 1600 rpm, RT), and treated 
with ACK lysis buffer (5 min, ThermoFisher, 
catalog no. A1049201). Samples were filtered 
with a 70-um filter, counted, and aliquoted for 
staining. 


Antibody panels and staining 


Approximately 1 x 10° to 5 x 10° freshly iso- 
lated PBMCs were used per patient per stain. 
See table S7 for buffer information and table 
S8 for antibody panel information. PBMCs 
were stained with live/dead mix (100 ul, 10 min, 
RT), washed with fluorescence-activated cell 
sorting (FACS) buffer, and spun down (1500 rpm, 
5 min, RT). PBMCs were incubated with 100 pl 
of Fc block (RT, 10 min) before a second wash 
(FACS buffer, 1500 rpm, 5 min, RT). Pellet was 
resuspended in 25 ul of chemokine receptor 
staining mix and incubated at 37°C for 20 min. 
After incubation, 25 11 of surface receptor stain- 
ing mix was directly added, and the PBMCs 
were incubated at RT for a further 45 min. 
PBMCs were washed (FACS buffer, 1500 rpm, 
5 min, RT) and stained with 50 ul of sec- 
ondary antibody mix for 20 min at RT and 
then washed again (FACS buffer, 1500 rpm, 
5 min, RT). Samples were fixed and permea- 
bilized by incubating in 100 ul of Fix/Perm 
buffer (RT, 30 min) and washing in Perm 
Buffer (1800 rpm, 5 min, RT). PBMCs were 
stained with 50 ul of intracellular mix over- 
night at 4°C. The following morning, samples 
were washed (Perm Buffer, 1800 rpm, 5 min, 
RT) and further fixed in 50 ul of 4% para- 
formaldehyde (PFA). Before acquisition, sam- 
ples were diluted to 1% PFA, and 10,000 counting 
beads were added per sample (BD, catalog 
no. 335925). Live/dead mix was prepared in 
phosphate-buffered saline (PBS). For the sur- 
face receptor and chemokine staining mix, 
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antibodies were diluted in FACS buffer with 
50% BD Brilliant Buffer (BD, catalog no. 566349). 
Intracellular mix was diluted in Perm Buffer. 


Flow cytometry 


Samples were acquired on a five-laser BD FACS 
Symphony A5. Standardized SPHERO rainbow 
beads (Spherotech, catalog no. RFP-30-5A) were 
used to track and adjust photomultiplier tubes 
over time. UltraComp eBeads (ThermoFisher, 
catalog no. 01-2222-42) were used for compen- 
sation. Up to 2 x 10° live PBMCs were acquired 
per sample. 


Luminex 


PBMCs from patients were thawed and rested 
overnight at 37°C in complete RPMI (table S7). 
Flat-bottom plates with 96 wells were coated 
with 1 ug/ml of anti-CD3 (UCHTI, no. BE0231, 
BioXell) in PBS at 4°C overnight. The next day, 
cells were collected and plated at 1 x 10° per 
well in 100 ul in duplicate. Anti-human CD28/ 
CD49d (2 ug/ml) was added to the wells con- 
taining plate-bound anti-CD3 (Clone L293, 
347690, BD). PBMCs were stimulated or left 
unstimulated for 16 hours and spun down 
(1200 rpm, 10 min), and supernatant (85 ul 
per well) was collected. Plasma from matched 
individuals was thawed on ice and spun 
(3000 rpm, 1 min) to remove debris, and 85 yl 
were collected in duplicate. Luminex assay 
was run according to manufacturer’s in- 
structions, using a custom human cytokine 
31-plex panel (EMD Millipore Corporation, 
SPRCUS707). The panel included EGF, FGF-2, 
eotaxin, sIL-2Ra, G-CSF, GM-CSF, IFN- 
a2, IFN-y, IL-10, IL-12P40, IL-12P70, IL-13, IL- 
15, IL-17A, IL-1Ra, HGF, IL-18, CXCL9/MIG, 
IL-2, IL-4, IL-5, IL-6, IL-7, CKCL8/IL-8, CXCL10/ 
IP-10, CCL2/MCP-1, CCL3/MIP-la, CCLA/MIP-1B, 
RANTES, TNF-a, and VEGF. Assay plates were 
measured using a Luminex FlexMAP 3D instru- 
ment (ThermoFisher, catalog no. APX1342). 

Data acquisition and analysis were performed 
using xPONENT software (www.luminexcorp. 
com/xponent/). Data quality was examined on 
the basis of the following criteria: The standard 
curve for each analyte has a five-parameter R” 
value > 0.95 with or without minor fitting using 
xPONENT software. To pass assay technical 
quality control, the results for two controls in 
the kit needed to be within the 95% confidence 
interval provided by the vendor for >25 of the 
tested analytes. No further tests were done on 
samples with results categorized as out-of- 
range low (<OOR). Samples with results that 
were out-of-range high (>OOR) or greater than 
the standard curve maximum value (SC max) 
were not tested at higher dilutions without 
further request. 


Intracellular stain after CD3/CD28 stimulation 


Flat-bottom plates (96 wells) were coated with 
1 ug/ml of anti-CD3 (UCHTI1, no. BE0231, 
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BioXell) in PBS at 4°C overnight. The next 
day, cells were collected and plated at 1 x 10° 
per well in 100 ul with 1/1000 of GolgiPlug 
(BD, no. 555029). Anti-human CD28/CD49d 
(2 ug/ml) was added to the wells containing 
plate-bound anti-CD3 (Clone L293, 347690, 
BD). GolgiPlug-treated PBMCs were stimu- 
lated or left unstimulated for 16 hours, spun 
down (1200 rpm, 10 min), and stained for 
intracellular IFNy. 


Longitudinal analysis DO to D7 and patient 
grouping 

To identify participants in which the frequency 
of specific immune cell populations increased, 
decreased, or stayed stable over time (DO to 
D7), we used a previously published dataset 
(where data were available) to establish a 
standard range of fold change over time in 
a healthy cohort (44). A fold change greater 
than the mean fold change + 2 standard 
deviations was considered an increase, less 
than this range was considered a decrease, 
and within this range was considered stable. 
Where these data were not available, a fold 
change from DO to D7 of between 0.5 and 1.5 
was considered stable. A fold change <0.5 was 
considered a decrease, and >1.5 was considered 
an increase. To eliminate redundant tests and 
maximize statistical power, the pairwise statis- 
tical tests shown in Fig. 5G were performed 
using fold change as a continuous metric, ir- 
respective of the discrete up, stable, or down 
classification described above. Similarly, as shown 
in fig. S9G, pairwise association tests between 
changes in UMAP component coordinates and 
clinical data were performed using each differ- 
ence value as a continuous metric, irrespective 
of the up, stable, or down classification. 


Correlation plots and heatmap visualization 


Pairwise correlations between variables were 
calculated and visualized as a correlogram 
using R function corrplot. Spearman’s rank 
correlation coefficient (p) was indicated by 
square size and heat scale; significance was 
indicated by *P < 0.05, **P < 0.01, and ***P < 
0.001; and a black box indicates a false- 
discovery rate (FDR) < 0.05. Heatmaps were 
created to visualize variable values using R 
function pheatmap or complexheatmap. 


Statistics 


Owing to the heterogeneity of clinical and flow 
cytometric data, nonparametric tests of asso- 
ciation were preferentially used throughout 
this study unless otherwise specified. Correla- 
tion coefficients between ordered features 
(including discrete ordinal, continuous scale, 
or a mixture of the two) were quantified by the 
Spearman rank correlation coefficient, and sig- 
nificance was assessed by the corresponding 
nonparametric methods (null hypothesis: p = 
0). Tests of association between mixed contin- 
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uous versus nonordered categorical variables 
were performed by unpaired Wilcoxon test (for 
n = 2 categories) or Kruskal-Wallis test (for 2 > 
2 categories). Association between categorical 
variables was assessed by Fisher’s exact test. 
For association testing illustrated in heatmaps, 
categorical variables with more than two cat- 
egories (e.g., ABO blood type) were trans- 
formed into binary “dummy” variables for 
each category versus the rest. All tests were 
performed in a two-sided manner, using a 
nominal significance threshold of P < 0.05 
unless otherwise specified. When appropri- 
ate to adjust for multiple hypothesis testing, 
FDR correction was performed using the 
Benjamini-Hochberg procedure at the FDR < 
0.05 significance threshold. Joint statistical 
modeling to adjust for confounding of demo- 
graphic factors (age, sex, and race) when testing 
for association of UMAP components 1 and 2 
with the NIH Ordinal Severity Scale was per- 
formed using ordinal logistic regression pro- 
vided by the polr function of the R package 
MASS. Statistical analysis of flow cytometry data 
was performed using the R package rstativ. 
Other details, if any, for each experiment are 
provided within the relevant figure legends. 


High-dimensional data analysis of flow 
cytometry data 


viSNE and FlowSOM analyses were performed 
on Cytobank (https://cytobank.org). B cells, 
non-naive CD4 T cells, and non-naive CD8 
T cells were analyzed separately. viSNE analy- 
sis was performed using equal sampling of 
1000 cells from each FCS file, with 5000 itera- 
tions, a perplexity of 30, and a theta of 0.5. 
For B cells, the following markers were used 
to generate the viSNE maps: CD45RA, IgD, 
CXCR5, CD138, Eomes, TCF-1, CD38, CD95, CCR7, 
CD21, K167, CD27, CX3CR1, CD39, T-bet, HLA- 
DR, CD16, CD19 and CD20. For non-naive 
CD4 and CD8 T cells, the following markers 
were used: CD45RA, PD-1, CXCR5, TCF-1, CD38, 
CD95, Eomes, CCR7, K167, CD16, CD27, CX3CRI, 
CD39, CD20, T-bet, and HLA-DR. Resulting 
viSNE maps were fed into the FlowSOM 
clustering algorithm (59). For each cell subset, 
a new self-organizing map (SOM) was gen- 
erated using hierarchical consensus clustering 
on the tSNE axes. For each SOM, 225 clusters 
and 10 or 15 metaclusters were identified for 
B cells and T cells, respectively. 

To group individuals on the basis of B cell 
landscape, pairwise EMD values were calcu- 
lated on the B cell tSNE axes for all COVID-19 
DO patients, HDs, and RDs using the emdist 
package in R, as previously described (60). 
Resulting scores were hierarchically clustered 
using the Aclust package in R. 


Batch correction 


During the sample-acquisition period, the flow 
panel was changed to remove one antibody. 
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Batch correction was performed for samples 
acquired before and after this change to 
remove potential bias from downstream anal- 
ysis. Because the primary flow features were 
expressed as a fraction of the parent popula- 
tion (falling in the 0-to-1 interval), a variance 
stabilizing transform (logit) was first applied 
to each data value prior to recentering the 
second panel to have the same mean as the 
first. After mean-centering, data were trans- 
formed back to the original fraction of parent 
scale by inverse transform. This procedure 
was applied separately to all 553 flow features 
annotated in the main text and supplemental 
data. Notably, this procedure avoids any batch- 
corrected feature values artificially falling out- 
side of the original 0-to-1 range. After batch 
correction, neither UMAP component 1 nor 
component 2 had a statistically significant 
difference between panels by unpaired Wil- 
coxon test. 


Visualizing variation of flow cytometric features 
across the UMAP embedding space 


A feature-weighted kernel density was com- 
puted across all COVID-19 patients and was 
displayed as a contour plot (Fig. 6G and fig. S8, 
A to D). Whereas traditional kernel density 
methods apply the same base kernel function 
to every point to visualize point density, in this 
case the base kernel function centered at each 
individual COVID-19 patient sample was in- 
stead weighted (multiplied) by the Z-transform 
(mean-centered and standard deviation- 
scaled) of the log-transformed input feature 
prior to computing the overall kernel density. 
This weighting procedure facilitated visual- 
ization of the overall feature gradients (from 
relatively low to high expression) across UMAP 
coordinates. independent of the different range 
of each input feature. A radially symmetric two- 
dimensional Gaussian was used as the base 
kernel function with a variance parameter of 
one-half, which was tuned to be sufficiently 
broad in order to smooth out local disconti- 
nuities and best visualize feature gradients. 


Definition of immunotype 3 


To define COVID-19 patients with low or ab- 
sent immune responses, classified as immuno- 
type 3, the intersection of the bottom 50% 
of five different flow parameters was used: PB 
as percentage of B cells, KI67* as percentage 
of non-naive CD4 T cells, KI67* as percentage 
of non-naive CD8 T cells, HLA-DR*CD38* 
as percentage of non-naive CD4 T cells, and 
HLA-DR*CD38" as percentage of non-naive 
CD8 T cells. See fig. S10. 
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Coronavirus disease 2019 (COVID-19) represents a global crisis, yet major knowledge gaps remain about 
human immunity to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). We analyzed 
immune responses in 76 COVID-19 patients and 69 healthy individuals from Hong Kong and Atlanta, 
Georgia, United States. In the peripheral blood mononuclear cells (PBMCs) of COVID-19 patients, we 
observed reduced expression of human leukocyte antigen class DR (HLA-DR) and proinflammatory 
cytokines by myeloid cells as well as impaired mammalian target of rapamycin (mTOR) signaling and 
interferon-o (IFN-@) production by plasmacytoid dendritic cells. By contrast, we detected enhanced 
plasma levels of inflammatory mediators—including EN-RAGE, TNFSF14, and oncostatin M—which 
correlated with disease severity and increased bacterial products in plasma. Single-cell transcriptomics 
revealed a lack of type | IFNs, reduced HLA-DR in the myeloid cells of patients with severe COVID-19, 
and transient expression of IFN-stimulated genes. This was consistent with bulk PBMC transcriptomics and 
transient, low IFN-o levels in plasma during infection. These results reveal mechanisms and potential 


therapeutic targets for COVID-19. 


he recent emergence of the severe acute 

respiratory syndrome coronavirus 2 (SARS- 

CoV-2) in Wuhan, China, in December 2019 

and its rapid international spread caused 

a global pandemic. Research has moved 
rapidly in isolating, sequencing, and cloning 
the virus; developing diagnostic kits; and test- 
ing candidate vaccines. However, key ques- 
tions remain about the dynamic interaction 
between the human immune system and the 
SARS-CoV-2 virus. 
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Coronavirus disease 2019 (COVID-19) presents 
with a spectrum of clinical phenotypes, with 
most patients exhibiting mild to moderate 
symptoms and 15% of patients progressing, 
typically within a week, to severe or critical dis- 
ease that requires hospitalization (J). A mi- 
nority of those who are hospitalized develop 
acute respiratory disease syndrome (ARDS) 
and require mechanical ventilation. Epidemi- 
ological data so far suggest that COVID-19 has 
acase fatality rate several times greater than 
that of seasonal influenza (/). The elderly and 
individuals with underlying medical comor- 
bidities such as cardiovascular disease, dia- 
betes mellitus, chronic lung disease, chronic 
kidney disease, obesity, hypertension, or can- 
cer have a much higher mortality rate than 
healthy young adults (2). The underlying causes 
of this difference are unknown, but they may 
be due to an impaired interferon (IFN) response 
and dysregulated inflammatory responses, as 
have been observed with other zoonotic corona- 
virus infections such as severe acute respiratory 
syndrome (SARS) and Middle East respira- 
tory syndrome (MERS) (3). Current research is 
uncovering how the adaptive immune re- 
sponse to SARS-CoV-2 is induced with optimal 
functional capacities to clear SARS-CoV-2 viral 
infection (4-6). 

Understanding the immunological mecha- 
nisms underlying the diverse clinical presen- 
tations of COVID-19 is a crucial step in the 
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design of rational therapeutic strategies. Re- 
cent studies have suggested that COVID-19 
patients are characterized by lymphopenia 
and increased numbers of neutrophils (7-9). 
Most patients with severe COVID-19 exhibit 
enhanced levels of proinflammatory cytokines 
including interleukin-6 (IL-6) and IL-1 as 
well as MCP-1, IP-10, and granulocyte colony- 
stimulating factor (G-CSF) in the plasma (JO). 
It has been proposed that high levels of pro- 
inflammatory cytokines might lead to shock 
as well as respiratory failure or multiple organ 
failure, and several trials to assess inflamma- 
tory mediators are under way (JI). However, 
little is known about the immunological mech- 
anisms underlying COVID-19 severity and the 
extent to which they differ from the immune 
responses to other respiratory viruses. Fur- 
thermore, the question of whether individuals 
in different parts of the world respond differ- 
ently to SARS-CoV-2 remains unknown. In this 
study, we used a systems biological approach 
[mass cytometry and single-cell transcriptomics 
of leukocytes, transcriptomics of bulk periph- 
eral blood mononuclear cells (PBMCs), and 
multiplex analysis of cytokines in plasma] to 
analyze the immune response in 76 COVID-19 
patients and 69 age- and sex-matched controls 
from two geographically distant cohorts. 


Analysis of peripheral blood leukocytes from 
COVID-19 patients by mass cytometry 


COVID-19-infected patient samples and sam- 
ples from age- and sex-matched healthy con- 
trols were obtained from two independent 
cohorts: (i) the Princess Margaret Hospital at 
Hong Kong University and (ii) the Hope Clinic 
at Emory University in Atlanta, Georgia, United 
States. Patient characteristics and the different 
assays performed are shown in Table 1. We used 
mass cytometry to assess immune responses to 
SARS-CoV-2 infection in 52 COVID-19 patients, 
who were confirmed positive for viral RNA by 
polymerase chain reaction (PCR), and 62 age- 
and gender-matched healthy controls distrib- 
uted between the two cohorts. To characterize 
immune cell phenotypes in PBMCs, we used a 
phospho-CyTOF panel that includes 22 cell 
surface markers and 12 intracellular markers 
against an assortment of kinases and phospho- 
specific epitopes of signaling molecules and 
H3K27ac—a marker of histone modification 
that drives epigenetic remodeling (72, 73) (table 
S1). The experimental strategy is described in 
Fig. 1A. The phospho-CyTOF identified 12 main 
subtypes of innate and adaptive immune cells in 
both cohorts, as represented in the t-distributed 
stochastic neighbor embedding (t-SNE) plots 
(Fig. 1B). There was a notable increase in the 
frequency of plasmablast and effector CD8 
T cells in all infected individuals (Fig. 1B) in both 
cohorts, as has been described recently in other 
studies (6, 8, 14). Of note, the kinetics of the 
CD8 effector T cell response were prolonged 
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and continued to increase up to day 40 after 
onset of the symptoms (fig. S1). 

We next used manual gating to identify 25 
immune cell subsets (fig. S2) and determined 
whether there were changes in the frequency 


A 


B 
1 million live 
PBMCs Fix with 2% PFA Freeze 
et ——* for30min (and ship) — 
a% 


a 
40 samples from Atlanta cohort & *| 
100 samples from Hong Kong cohort 


or signaling molecules of innate immune cell 
populations consistent between the two co- 
horts. There were several differences, but no- 
tably the frequency of plasmacytoid dendritic 
cells (pDCs) was significantly reduced in the 


Atlanta cohort 


PBMCs of SARS-CoV-2-infected individuals 
in both cohorts (Fig. 1C). The kinetics of pDC 
response did not show an association with the 
time since symptom onset (fig. S1C). Neither 
did the observed changes correlate with the 


Hong Kong cohort 
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Fig. 1. Mass cytometry analysis of human peripheral blood leukocytes 
from COVID-19 patients. (A) A schematic representation of the experimental 
strategy. PFA, paraformaldehyde. (B) Representation of mass cytometry— 
identified cell clusters visualized by t-SNE in two-dimensional space. The box 
plots on the bottom show frequency of plasmablasts (CD3", CD20", CD56", 
HLA-DR*, CD14, CD16, CDllc”, CD123", CD19°, CD27", and CD38") and effector 
CD8 T cells (CD3*, CD8*, CD38", and HLA-DR") in both cohorts. (C) Frequencies 
of pDCs (CD3°, CD20", CD56", HLA-DR*, CD14", CD16", CD11c", and CD123") 
in healthy and COVID-19-infected individuals in both cohorts. (D and E) Box 
plots showing fold change (FC) of pS6 staining in pDCs (D) and IxBa staining 
in mDCs (E) relative to the medians of healthy controls. The histograms on 
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the right depict representative staining of the same. (F) Distinguishing features 
[false discovery rate (FDR) < 0.01] through linear modeling analysis of the 
mass cytometry data between healthy and infected subjects. In all box plots, the 
boxes show median, upper, and lower quartiles. The whiskers show Sth to 

95th percentiles. Each dot represents an individual sample (healthy: n = 17 and 
45; infected: n = 19 and 54, for Atlanta and Hong Kong cohorts, respectively). 
For the t-SNE analysis, n = 34 and 60 for Atlanta and Hong Kong cohorts, 
respectively. The colors of the dots indicate the severity of clinical disease, as 
shown in the legends. The differences between the groups were measured by 
Mann-Whitney rank sum test (Wilcoxon, paired = FALSE). The P values depicting 
significance are shown within the box plots. 
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rT] 
Table 1. Patient characteristics and number of samples used in different assays. NA, not 


applicable. 


Characteristics 


Hong Kong cohort 


Atlanta cohort 


Number of subjects 


Olink proteomics NA 


*Some patients have blood from multiple time points. 


clinical severity of infection (fig. SI). Addi- 
tionally, there was reduced expression of pS6 
[(phosphorylated ribosomal protein S6), a 
canonical target of mammalian target of rapa- 
mycin (mTOR) activation (75)] in pDCs and 
decreased IxBa—an inhibitor of the signaling 
of the NF-«B transcription factor—in myeloid 
dendritic cells (mDCs) (Fig. 1, D and E). mTOR 
signaling is known to mediate the production 
of interferon-o (IFN-o) in pDCs (J6), which 
suggests that pDCs may be impaired in their 
capacity to produce IFN-o. in COVID-19 patients. 
Finally, we used a linear modeling approach 
to detect features that distinguish healthy from 
infected individuals and those that discrimi- 
nate individuals on the basis of the clinical se- 
verity of COVID-19. This analysis was performed 
with the cohort (Hong Kong or Atlanta) as a 
covariate to identify only features that were 
consistent across both cohorts. The distinguish- 
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66 (51-86) 


55% 


42% 


60% 


17 PBMC samples (15 patients) 
2 


19 plasma samples (16 patients) 


ing features between healthy and infected 
individuals are shown in Fig. 1F. These include 
frequencies of plasmablast and effector T cells 
and the changes in innate immune cells de- 
scribed above in addition to STATI (signal 
transducer and activator of transcription 1) 
and other signaling events in T cells and nat- 
ural killer (NK) cells. Of note, no features were 
significantly different between clinical sever- 
ity groups. 

We further examined the effect of various 
therapeutic interventions on the immune re- 
sponses using samples from the Hong Kong 
cohort, in which some patients were treated 
with IFN-B1, corticosteroids, or antivirals. The 
infected individuals, irrespective of the inter- 
vention, showed an increased plasmablast and 
effector CD8 T cell frequency compared with 
healthy controls (fig. S3). However, there was 
an increased frequency of effector CD8 T cells 
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(fig. S3, bottom panel, right column) and de- 
creased pS6 signal in the pDCs of antiviral- 
treated individuals (fig. S4). 


COVID-19 results in functional impairment 
of blood myeloid cells and pDCs 


Given the earlier findings that mTOR signal- 
ing in pDCs mediates the production of IFN-o 
in response to Toll-like receptor (TLR) stimu- 
lation (16), the reduced expression of pS6 in 
pDCs suggests that such cells may be impaired 
in their capacity to produce IFN-a. To test this, 
we performed ex vivo stimulation of PBMCs 
from healthy or COVID-19-infected individ- 
uals, using a mixture of synthetic TLR7 and 
TLR 8 (TLR7/8) and TLR3 ligands, which are 
known to be expressed by viruses, and we per- 
formed an intracellular staining assay to detect 
cytokine responses. The TLR ligands included 
TLR3 and TLR7/8 ligands, polyIC and R848. 
Consistent with our hypothesis, there was re- 
duced production of IFN-o in response to the 
TLR stimuli in the pDCs of infected individu- 
als compared with those of healthy controls 
(Fig. 2A). The TNF-a response was also signif- 
icantly reduced in the pDCs of infected indi- 
viduals, which demonstrates that the pDCs are 
functionally impaired in COVID-19 infection. 
We also determined the ability of mDCs and 
CD14* monocytes to respond to TLR stimuli. 
Notably, the response in mDCs as well as that 
in monocytes were also significantly lower in 
response to stimulation with a bacterial ligand 
cocktail (composed of TLR2, TLR4, and TLR5 
ligands) or with the viral TLR cocktail (Fig. 2B 
and fig. S5). Furthermore, the reduced IxBa 
levels did not translate into enhanced NF-«B 
subunit p65 phosphorylation as measured 
by p65 (Ser””’) in the same cells (Fig. 2C). These 
results suggest that the innate immune cells in 
the periphery of COVID-19-infected individuals 
are suppressed in their response to TLR stim- 
ulation, irrespective of the clinical severity. 


Enhanced concentrations of cytokines 
and inflammatory mediators in plasma 
from COVID-19 patients 


The impaired cytokine response of myeloid 
cells and pDCs in response to TLR stimulation 
was unexpected and seemingly at odds with 
the literature describing an enhanced inflam- 
matory response in COVID-19-infected indi- 
viduals. Several studies have described higher 
plasma levels of cytokines, including but not 
limited to IL-6, TNF-a, and CXCL10 (10, 17-19). 
Therefore, we evaluated cytokines and chemo- 
kines in plasma samples from the Atlanta co- 
hort using the Olink multiplex inflammation 
panel that measures 92 different cytokines 
and chemokines. Of the 92 analytes measured, 
71 proteins were detected within the dynamic 
range of the assay. Of these 71 proteins, 43 cy- 
tokines, including IL-6, MCP-3, and CXCL10, 
were significantly up-regulated in COVID-19 
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infection (Fig. 3, top row, and fig. S6). These 
results demonstrate that plasma levels of in- 
flammatory molecules were significantly up- 
regulated, despite the impaired cytokine response 
in blood myeloid cells and pDCs, which sug- 
gests a tissue origin of the plasma cytokines. 
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In addition to IL-6 and other cytokines de- 
scribed previously (JO), we identified three 
proteins that were significantly enhanced in 
COVID-19 infection and strongly correlated 
with clinical severity (Fig. 3, bottom row). These 
were TNFSF14 [LIGHT, a ligand of lympho- 


Fig. 2. Flow cytometry analysis of ex 
vivo stimulated human peripheral blood 
leukocytes from COVID-19 patients. 

(A) Box plots showing the fraction of pDCs 
in PBMCs of healthy or infected donors 
(CD3°, CD20", CD56", HLA-DR*, CD14, 
CD16, CD1lc’, and CD123*) producing 
IFN-o, TNF-a, or IFN-o + TNF-o in 
response to stimulation with the viral 
cocktail (polylC + R848). The contour plots 
on the right show IFN-o, TNF-a, or IFN-o + 
TNF-a staining in pDCs. (B) Box plots 
showing the fraction of mDCs in PBMCs of 
healthy or infected donors (CD3°, CD20", 
CD56, HLA-DR*, CD14", CD16, CD123", 
and CD11c’) producing IL-6, TNF-a, or 
IL-6 + TNF-a in response to no stimulation 
(top), the bacterial cocktail (middle; 
Pam3CSK4, LPS, and Flagellin), or the viral 
cocktail (bottom; polylC + R848). The 
flow cytometry plots on the right are 
representative plots gated on mDCs 
showing IL-6, TNF-a, or IL-6 + TNF-o 
response. (C) Fold change of NF-«B p65 
(Ser®”°) staining in PBMCs stimulated 
with bacterial cocktail relative to no 
stimulation in healthy and infected donors 
to show the reduced induction of p65 
phosphorylation in infected individuals. The 
histograms show representative flow 
cytometry plots of p65 staining in mDCs. 
GeoMFl, geometric mean fluorescence 
intensity. In all box plots, the boxes show 
median, upper, and lower quartiles. 

The whiskers show 5th to 95th percentiles. 
Each dot represents an Atlanta cohort 
patient (n = 14 and 17 for healthy and 
nfected, respectively). Colors of the 

dots indicate the severity of clinical dis- 
ease, as shown in the legends. The 
differences between the groups were 
measured by Mann-Whitney rank sum test. 
The P values depicting significance are 
shown within the box plots. 


toxin B receptor that is highly expressed in 
human lung fibroblasts and implicated in lung 
tissue fibrosis and remodeling and inflam- 
mation (20)], EN-RAGE [S100A12, a biomarker 
of pulmonary injury that is implicated in path- 
ogenesis of sepsis-induced ARDS (2/)], and 
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oncostatin M [(OSM), a regulator of IL-6]. 
Of note, the TNFSF14 is distinctively enhanced 
in the plasma of COVID-19-infected individuals 
but not in cases of other related pulmonary 
infections such as influenza (flu) virus and 
respiratory syncytial virus (RSV) (Fig. 3). Given 
the pronounced and unappreciated obser- 
vations of the enhanced plasma concentra- 
tions of TNFSF14, EN-RAGE, and OSM and 
their correlation to disease severity, we used an 
enzyme-linked immunosorbent assay (ELISA) 
to independently validate these results. Con- 
sistent with the multiplex Olink analysis, we 
found a significant increase of these inflam- 
matory mediators in the plasma of severe and 
intensive care unit (ICU) COVID-19 patients. 


Fig. 3. Multiplex analysis of cytokines in the 
plasma of COVID-19 patients. Cytokine levels in 
the plasma of healthy or infected individuals. 

The infected individuals are further classified on 
the basis of the severity of their clinical COVID-19 
disease. The normalized protein expression values 
plotted on the y axes are arbitrary units defined by 
Olink Proteomics to represent Olink data. In all 
box plots, the boxes show median, upper, and lower 
quartiles. The whiskers show 5th to 95th percent- 
iles. Each dot represents an Atlanta cohort 
sample (n = 18 healthy, 4 moderate, 18 severe, 

2 ICU, 2 convalescent, 8 flu, and 11 RSV). 

The colors of the dots indicate the severity of clinical 
disease, as shown in the legends. The differences 
between the groups were measured by Mann-Whitney 
ank sum test (Wilcoxon, paired = FALSE; *P < 
0.05; **P < 0.01; ***P < 0.001; ns, not significant). 


Fig. 4. Early, transient ISG expression in COVID-19 infection. (A) A sche- 
matic representation of DC enrichment strategy used in CITE-seq analysis. 
scRNA-seq, single-cell RNA-seq. (B) UMAP representation of PBMCs from all 
n= 12), colored by manually annotated cell type. (C) Pairwise 
comparison of genes from healthy individuals (n = 5) and COVID-19-infected 
patients (n = 7) was conducted for each cluster. DEGs were analyzed for 
overrepresentation of BTMs. The ringplot shows overrepresented pathways in 
up- and down-regulated genes of each cluster. The heatmap on the right shows 
the average expression levels of 33 ISGs derived from the enriched BTMs 

in different cell clusters of healthy (n = 5) and COVID-19 subjects (n = 7). 
(D) UMAP representation of PBMCs from all analyzed samples showing the 


analyzed samples 
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Furthermore, we found a correlation between 
multiplex analysis by Olink and the ELISA 
results (fig. S7). These results suggest that 
COVID-19 infection induces a distinctive in- 
flammatory program characterized by cyto- 
kines released from tissues (most likely the 
lungs) but suppression of the innate immune 
system in the periphery. These observations 
may also represent previously unexplored ther- 
apeutic strategies for intervention against se- 
vere COVID-19. 


Single-cell transcriptional response 
to COVID-19 infection 


To investigate the molecular and cellular pro- 
cesses that lead to the distinctive inflamma- 


tory program, we used cellular indexing of 
transcriptomes and epitopes by sequencing 
(CITE-seq) and profiled the gene and pro- 
tein expression in PBMC samples of COVID-19- 
infected individuals. Cryopreserved PBMC sam- 
ples from a total of 12 age-matched subjects 
in the Atlanta cohort (five healthy controls and 
seven COVID-19 patients; Table 2) were en- 
riched for DCs, stained using a cocktail of 36 
DNA-labeled antibodies (table S2), and analyzed 
using droplet-based single-cell gene expres- 
sion profiling approaches (Fig. 4A). We per- 
formed the experiment in two batches and 
obtained transcriptomes for more than 63,000 
cells after initial preprocessing. Next, we gen- 
erated a cell-by-gene matrix and conducted 
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expression levels of selected IFNs and |SGs. (E) Kinetics of circulating 
levels (picograms per milliliter) in plasma measured using SIMoA technology 

(n = 18 healthy and 40 COVID-19-infected patients). (F) Correlation between 
circulating IFN-o levels in plasma and the average expression of ISGs measured 

by CITE-seq analysis. (G) Hierarchically clustered heatmap of the expression of 
the CITE-seq ISG signature (C) in the bulk RNA-seq dataset, performed using 

an extended group of subjects (n = 17 healthy and 17 COVID-19-infected 
samples). Colors represent gene-wise z scores. (H) Bar chart representing 
the proportion of variance in CITE-seq ISG signature expression explained by 
the covariates in the x axis through principal variance component analysis 
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dimensionality reduction through uniform 
manifold approximation and projection (UMAP) 
and graph-based clustering. Analysis of cell 
distribution within the UMAP between exper- 
iments revealed no major differences, and we 
analyzed the datasets from the two experiments 
together without batch correction (fig. S8). Next, 
we calculated the per-cell quality control (QC) 
metrics (fig. S9), differentially expressed genes 
(DEGs) in each cluster compared with all other 
cells (fig. S10 and table S4), and the abundance 
of DNA-labeled antibodies in each cell (fig. S11). 
Using this information, we filtered low-quality 
cells and manually annotated the clusters. After 
QC and cluster annotation, we retained a final 
dataset with 57,669 high-quality transcriptomes 
and a median of ~4781 cells per sample and 
1803 individual genes per cell that we used to 
construct the single-cell immune cell landscape 
of COVID-19 (Fig. 4B). 

We observed several clusters that were pri- 
marily identified in COVID-19-infected indi- 
viduals, including a population of plasmablasts, 
platelets, and red blood cells and several pop- 
ulations of granulocytes. Notably, we detected 
clusters of T cells and monocytes that were 
characterized by the expression of interferon- 
stimulated genes (ISGs) such as IFI27, IFITM3, 
or ISG15 (see Cl1-C MONO_IFN and C18-T_IFN 
in fig. S10). These IFN response-enriched clus- 
ters emerged only in samples from COVID-19 
patients (fig. S12). 

To describe the specific transcriptional state 
of single cells from COVID-19-infected indi- 
viduals, we determined the DEGs for cells from 
all COVID-19-infected samples in a given clus- 
ter compared with the cells from all healthy 
individuals in the same cluster. We then an- 
alyzed these DEGs with overrepresentation 
analysis using blood transcriptional modules 
(BTMs) (22) to better understand which im- 
mune pathways are differentially regulated in 
patients with COVID-19 compared with healthy 


individuals (Fig. 4C and fig. $13). The analysis 
indicated a marked induction of antiviral BTMs, 
especially in cell types belonging to the mye- 
loid and dendritic cell lineage. Detailed anal- 
ysis of the expression pattern of the distinct 
union of genes driving the enrichment of these 
antiviral pathways in monocytes and dendritic 
cells revealed that many ISGs were up-regulated 
in these cell types (Fig. 4C, heatmap). Given 
our observations of muted IFN-o production 
in pDCs (Fig. 2A), we investigated the expres- 
sion of genes encoding various type I and type 
II IFNs across cell types (Fig. 4D and fig. S14). 
Notably, with the exception of modest levels 
of IFN-y expression in T and NK cells, we could 
not detect any expression of IFN-co. and -B genes, 
which is consistent with the functional data 
demonstrating impaired type I IFN produc- 
tion by pDCs and myeloid cells (Fig. 2). How- 
ever, there was an enhanced expression of ISGs 
in patients with COVID-19 (Fig. 4D) in spite of 
an impaired capacity of the innate cells in the 
blood compartment to produce these cytokines. 

Despite the lack of type I IFN gene expres- 
sion, the presence of an ISG signature in the 
PBMCs of COVID-19-infected individuals raised 
the possibility that low quantities of type I IFNs 
produced in the lung by SARS-CoV-2 infection 
(77) might circulate in the plasma and induce 
the expression of ISGs in PBMCs. We thus mea- 
sured the concentration of IFN-o in plasma 
using a highly sensitive ELISA enabled by single 
molecule array (SIMoA) technology. We ob- 
served a marked increase in the concentra- 
tion of IFN-o, which peaked around day 8 after 
onset of symptoms and regressed to baseline 
levels by day 20 (Fig. 4E). Notably, we observed a 
strong correlation between the average ex- 
pression levels of the ISG signature in PBMCs 
identified by CITE-seq analysis and the IFN-o. 
concentration in plasma (Fig. 4F). Addition- 
ally, we noticed a strong temporal dependence 
of the IFN-a response. 


Table 2. Detailed characteristics of patient samples used in the CITE-seq analysis. Dashes 
indicate that the information is not applicable. dec., deceased; F, female; M, male; B, Black; W, white. 


ID Infection 
covl COVID-19 


Severe, dec. 


hd5 Healthy 
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To investigate this further and to indepen- 
dently validate the observations in the CITE-seq 
analysis, we performed bulk RNA sequenc- 
ing (RNA-seq) analysis of PBMCs in an ex- 
tended group of subjects (17 COVID-19 patients 
and 17 healthy controls) from the same cohort. 
We first evaluated whether the ISG signature 
containing 33 genes identified in the CITE-seq 
data was also observed in the bulk RNA-seq 
dataset. We observed a strong induction of the 
ISGs in COVID-19 subjects compared with 
healthy donors in this dataset as well (Fig. 4G). 
Of note, we did not detect expression of genes 
encoding IFN-o or IFN-f, consistent with the 
CITE-seq and flow cytometry experiments (Fig. 
4D and Fig. 2, respectively). We also performed 
an unbiased analysis of an extended set of genes 
in the IFN transcriptional network (23) and 
found that these were induced in COVID-19 
subjects relative to healthy controls, as observed 
for the limited ISG signature (fig. SI5A). Sim- 
ilar to the observation with CITE-seq data (Fig. 
4F), there was a strong correlation between cir- 
culating IFN-o and the ISG response measured 
by the bulk transcriptomics (fig. S15B). Addi- 
tionally, we analyzed the individual impact of 
major covariates—time, disease severity, sex, and 
age—on the observed ISG signature. Although 
time emerged as the primary driver of ISG sig- 
nature, COVID-19 clinical severity also had an 
effect (Fig. 4H and fig. $15C). Taken together, 
these data demonstrate that, early during SARS- 
CoV-2 infection, there are low levels of circulat- 
ing IFN-o that induce ISGs in the periphery 
while the innate immune cells in the periph- 
ery are impaired in their capacity to produce 
inflammatory cytokines. 

In addition to an enhanced ISG signature, 
the CITE-seq analysis revealed a significant 
decrease in the expression of genes involved in 
the antigen-presentation pathways in myeloid 
cells (Fig. 4C and fig. $13). Consistent with this, 
we observed a reduction in the expression of 
the proteins CD86 and human leukocyte anti- 
gen class DR (HLA-DR) on monocytes and 
mDCs of COVID-19 patients, which was most 
pronounced in subjects with severe COVID-19 
infection (Fig. 5A and fig. S16A). HLA-DR is an 
important mediator of antigen presentation 
and is crucial for the induction of T helper cell 
responses. Using the phospho-CyTOF data from 
both the Atlanta and Hong Kong cohorts, we 
confirmed the reduced expression of HLA-DR 
on monocytes and mDCs in patients with severe 
COVID-19 disease (Fig. 5B). By contrast, S100A12, 
the gene encoding EN-RAGE, was substantially 
increased in the PBMCs of COVID-19 patients, 
whereas the expression of genes encoding other 
proinflammatory cytokines was either absent or 
unchanged compared with healthy controls 
(Fig. 5C and fig. S16B). Notably, the S100A12 
expression was highly restricted to monocyte 
clusters (Fig. 5D) and showed a significant cor- 
relation with EN-RAGE protein levels in plasma 
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A Flow: HLA-DR levels (CITE-seq samples) 
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Fig. 5. Attenuated inflammatory response in peripheral innate immune 
cells from COVID-19 patients. (A) Flow cytometry analysis of PBMCs analyzed 
in parallel to the CITE-seq experiment. The logjg median fluorescence intensity 
(MFI) of HLA-DR expression is shown. (B) Median intensity of HLA-DR expression 
in the phospho-CyTOF experiment from Fig. 1. Squares represent individual 
samples [Hong Kong (HK): healthy = 30, moderate = 15, and severe = 10; and 
Atlanta: healthy = 17, moderate = 4, and severe = 13]. The boxes indicate median, 
upper, and lower quartiles. The whisker length equals 1.5 times the interquartile 
range. (C) Relative (Rel.) expression of genes encoding different cytokines in the 


bulk RNA-seq dataset. The boxes show median, upper, and lower quartiles, 

and the whiskers show 5th to 95th percentiles. (D) UMAP representation of 
S100A12 expression in PBMCs from all samples analyzed by CITE-seq. 

(E and F) Correlation (Cor) analysis of SIOOA12 expression in cells from myeloid 
and dendritic cell clusters (C MONO_1, NC MONO, CDC2, PDC, C MONO_IFN, 

C MONO_2, and C MONO_3) with EN-RAGE levels in plasma (E) or HLA-DPA1 
expression in the same clusters (F) (n = 5 healthy and 7 COVID-19 subjects). The 
statistical significance between the groups in (B) and (C) was determined by 
two-sided Mann-Whitney rank-sum test; *P < 0.05; **P < 0.01; ***P < 0.001. 


measured by Olink (Fig. 5E). Finally, we exam- 
ined whether there is an association between 
HLA-DR and S100A12 expression in our data- 
set, and we found a strong inverse correlation 
between S100A12 gene expression and the 
genes encoding the antigen presentation ma- 
chinery (HLA-DPA1, HLA-DPB1, HLA-DR, and 
CD74) (Fig. 5F and fig. S17). Notably, the re- 
ceptor for S100A12, AGER (RAGE), was ex- 
pressed sparsely in PBMCs (fig. S18), which 
suggests that the target of EN-RAGE action 
was likely to be elsewhere—perhaps the lung, 
where RAGE is known to be expressed in type 
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I alveolar epithelial cells and mediate inflam- 
mation (24). 

Taken together, CITE-seq analysis of PBMCs 
in COVID-19 patients revealed the following 
mechanistic insights: (i) a lack of expression of 
genes encoding type I IFN and proinflamma- 
tory cytokines in PBMCs, which was consistent 
with the mass cytometry (Fig. 1C) and func- 
tional data (Fig. 2); (ii) an early but transient 
wave of ISG expression, which was entirely 
consistent with analysis of RNA-seq from bulk 
PBMCs (Fig. 4G and fig. S15A) and strongly 
correlated with an early burst of plasma IFN-a 
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(Fig. 4F), likely of lung origin (17); and (iii) the 
impaired expression of HLA-DR and CD86 but 
enhanced expression of S100A12 in myeloid 
cells, which was consistent with the mass cy- 
tometry (Fig. 5B), Olink (Fig. 3), and ELISA 
(fig. S7) data, and is a phenotype reminiscent 
of myeloid-derived suppressor cells described 
previously (25). 


Severe COVID-19 infection is associated with 
the systemic release of bacterial products 


The increased levels of proinflammatory me- 
diators in the plasma—including IL-6, TNF, 
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TNEFSF14, EN-RAGE, and OSM (Fig. 3)—coupled 
with suppressed innate immune responses 
in blood monocytes and DCs (Fig. 2 and fig. 
S5) suggested a sepsis-like clinical condition 
(26, 27). In this context, it has been previously 
suggested that proinflammatory cytokines 
and bacterial products in the plasma may play 
pathogenic roles in sepsis, and the combina- 
tion of these factors could be important in 
determining patient survival (28, 29). There- 
fore, to determine whether a similar mech- 
anism could be at play in patients with severe 
COVID-19, we measured bacterial DNA and 
lipopolysaccharide (LPS) in the plasma. Nota- 
bly, the plasma of severe and ICU patients had 
significantly higher levels of bacterial DNA, 
as measured by PCR quantitation of bacterial 
16S ribosomal RNA (rRNA) gene product, and 
of LPS, as measured by a TLR4-based reporter 
assay (Fig. 6, A and B). Furthermore, there was 
a significant correlation between bacterial DNA 
or LPS and the plasma levels of the inflamma- 
tory mediators IL-6, TNF, MCP-3, EN-RAGE, 
TNFSF14, and OSM (Fig. 6C and fig. S19). These 
results suggest that the enhanced cytokine re- 
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lease may in part be caused by increased bacte- 
rial products in the lung or in other tissues. 


Discussion 


We used a systems biology approach to deter- 
mine host immune responses to COVID-19. 
Mass cytometry analysis of peripheral blood 
leukocytes from two independent cohorts re- 
vealed several common features of immune 
responses induced upon SARS-CoV-2 infec- 
tion. There was a notable and protracted in- 
crease in the frequencies of plasmablasts and 
effector CD8 T cells in the peripheral blood, 
consistent with recent studies (6, 8, 14). Nota- 
bly, the effector T cells continued to increase 
up to day 40 after symptom onset. Studies 
have shown that SARS-CoV-2 infection indu- 
ces exhaustion and apoptosis in T cells (30, 37). 
Whether the continuing effector CD8 T cell 
response reflects continuous exposure to anti- 
gen and whether the cells are exhausted will 
require further investigation. 

In contrast to robust activation of B and T 
cells, we observed a significant decrease in the 
frequency of pDCs. Furthermore, mTOR sig- 
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naling in pDCs was reduced significantly in 
COVID-19-infected individuals, as measured 
by decreased pS6 signaling by mass cytome- 
try. These results suggest that pDCs, the pri- 
mary producers of type I IFNs, are impaired in 
COVID-19 infection, which is consistent with 
studies in SARS-CoV infection (32). To deter- 
mine whether the reduced mTOR signaling in 
pDCs resulted in impairment of type I IFN pro- 
duction, we stimulated cells in vitro with TLR 
ligands. Our results demonstrate that pDCs 
from COVID-19-infected patients are func- 
tionally impaired in their capacity to produce 
IFN-o in response to TLR stimulation. Taken 
together, these data suggest that COVID-19 
causes an impaired type I IFN response in 
the periphery. Administration of type I IFN 
has been proposed as a strategy for COVID-19 
intervention (33); however, it must be noted 
that type I IFN signaling has been shown to 
elevate angiotensin-converting enzyme 2 (ACE2) 
expression (34) in lung cells, which can po- 
tentially lead to enhanced infection. 

In addition to the impaired IFN-o produc- 


tion by pDCs, there was a marked diminution 
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of the proinflammatory cytokines IL-6, TNF-o, 
and IL-18 produced by monocytes and mDCs 
upon TLR stimulation (Fig. 2B). This was con- 
sistent with the lack of or diminished expres- 
sion of the genes encoding IL-6 and TNF in 
the CITE-seq analysis (Fig. 5C). These results 
suggest an impaired innate response in blood 
leukocytes of patients with COVID-19. This 
concept was further supported by the CyTOF 
and flow cytometry data that showed decreased 
HLA-DR and CD86 expression, respectively, 
in myeloid cells (Fig. 5, D and E, and fig. S16). 
To obtain deeper insight into the mechanisms 
of host immunity to SARS-CoV-2, we performed 
CITE-seq single-cell RNA-seq and bulk RNA-seq 
analysis in COVID-19 patients at various stages 
of clinical severity. Our data demonstrate that 
SARS-CoV-2 infection results in an early wave 
of IFN-a in the circulation that induces an ISG 
signature. Although the ISG signature shows 
a strong temporal dependence in our data- 
sets, we also find that the ISG signature is 
strongly induced in patients with moderate 
COVID-19 infection (Fig. 4G). Consistent with 
this, Hadjadj et al. (5) have reported an en- 
hanced expression of ISGs in patients with 
moderate disease compared with those with 
severe or critical disease. Taken together, these 
data suggest that SARS-CoV-2 infection induces 
an early, transient type I IFN production in the 
lungs that induces ISGs in the peripheral blood, 
primarily in patients with mild or moderate 
disease. Additionally, we observed reduced 
expression of genes encoding proinflamma- 
tory cytokines, as well as HLA-DR expression 
in myeloid cells, which was consistent with the 
CyTOF and flow cytometry data showing re- 
duced HLA-DR and CD86 expression, respec- 
tively, in myeloid cells. 

Our multiplex analysis of plasma cytokines 
revealed enhanced levels of several proin- 
flammatory cytokines, as has been observed 
previously (35), and revealed a strong associ- 
ation of the inflammatory mediators EN-RAGE, 
TNFSF14, and OSM with the clinical severity 
of the disease. Notably, the expression of genes 
encoding both TNFSF14 and OSM were down- 
regulated in the PBMCs from COVID-19 pa- 
tients with severe disease in the analysis of 
CITE-seq data (Fig. 5C), which suggests a tis- 
sue origin for these cytokines. The gene en- 
coding EN-RAGE, however, was expressed at 
high levels in blood myeloid cells in patients 
with severe COVID-19 (Fig. 5, C to F) (although 
it is also possible that EN-RAGE is expressed 
in the lungs too). Of note, these three cytokines 
have been associated with lung inflamma- 
tory diseases. In particular, EN-RAGE has been 
shown to be expressed by CD14* HLA-DR"? 
cells, the myeloid-derived suppressor cells, and 
it is a marker of inflammation in severe sepsis 
(21, 25, 36). Additionally, its receptor, RAGE, 
is highly expressed in type I alveolar cells in 
the lung (24). Notably, we observed that the 
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classical monocytes and myeloid cells from 
severe COVID-19 patients in the single-cell 
RNA-seq data expressed high levels of S100A12, 
the gene encoding EN-RAGE, but not the typ- 
ical inflammatory molecules IL-6 and TNF-a. 
These data suggest that the proinflammatory 
cytokines observed in plasma likely originate 
from the cells in lung tissue rather than from 
peripheral blood cells. Taken together with 
the mass cytometry data, the plasma cytokine 
data may be utilized to construct an immu- 
nological profile that discriminates between 
severe versus moderate COVID-19 disease 
(fig. S20). 

These results suggest that SARS-CoV-2 infec- 
tion results in a spatial dichotomy in the innate 
immune response, characterized by suppres- 
sion of peripheral innate immunity in the face 
of proinflammatory responses that have been 
reported in the lungs (37). Furthermore, there 
is a temporal shift in the cytokine response 
from an early but transient type I IFN response 
to a proinflammatory response during the later 
and more severe stages, which is similar to that 
observed with other diseases such as influenza 
(38). Notably, there were enhanced levels of 
bacterial DNA and LPS in the plasma, which 
were positively correlated with the plasma 
levels of EN-RAGE, TNFSF14, OSM, and IL-6, 
which suggests a role for bacterial products— 
perhaps of lung origin—in augmenting the pro- 
duction of inflammatory cytokines in severe 
COVID-19. The biological consequence of the 
impaired innate response in peripheral blood 
is unknown but may reflect a homeostatic 
mechanism to prevent rampant systemic hyper- 
activation, in the face of tissue inflammation. 
Finally, these results highlight molecules such 
as EN-RAGE or TNFSF14, and their receptors, 
which could represent attractive therapeutic 
targets against COVID-19. 
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Structure of a human 48S translational 


initiation complex 


Jailson Brito Querido’*, Masaaki Sokabe*, Sebastian Kraatz"*, Yuliya Gordiyenko’, J. Mark Skehel?, 


Christopher S. Fraser*+, V. Ramakrishnan‘+ 


A key step in translational initiation is the recruitment of the 43S preinitiation complex by the 
cap-binding complex [eukaryotic initiation factor 4F (elF4F)] at the 5' end of messenger RNA (mRNA) 
to form the 48S initiation complex (i.e., the 48S). The 48S then scans along the mRNA to locate a 
start codon. To understand the mechanisms involved, we used cryo-electron microscopy to determine 
the structure of a reconstituted human 48S. The structure reveals insights into early events of 
translation initiation complex assembly, as well as how elF4F interacts with subunits of elF3 near the 
mRNA exit channel in the 43S. The location of elF4F is consistent with a slotting model of mRNA 
recruitment and suggests that downstream mRNA is unwound at least in part by being “pulled” 


through the 40S subunit during scanning. 


he recruitment of the 43S preinitiation 

complex (43S PIC) to the 5’ end of mRNA is 

a critical step during translation initiation. 

Eukaryotic initiation factors eIF1, eIF1A, 

elF3, and eIF5 and the ternary complex 
(TC) of eIF2-guanosine 5’-triphosphate (GTP)- 
methionine initiator transfer RNA (tRNA) 
bind to the 40S ribosomal subunit to form the 
43S PIC. Once assembled, the 43S PIC is re- 
cruited to the cap-binding complex eIF4F at 
the 5’ end of mRNA to form a 48S initiation 
complex (i.e., the 48S). eIF4F consists of a scaf- 
fold protein eIF4G, a 7-methylguanosine (m’G) 
cap-binding protein eIF4E, and a DEAD-box 
helicase eIF4A. This complex enhances 43S 
PIC binding and scanning along the mRNA 
until the start codon is recognized (1-3). In 
mammals, the recruitment of 43S PIC to mRNA 
requires interactions between eIF3 and the 
eIF4G subunit of eIF4F (4-6). 

Mammalian eIF3 is a 13-subunit complex 
(eIF3a to -m) that coordinates several aspects 
of translation. It stabilizes the binding of the 
TC on the 40S and interacts with eIF1, eIF1A, 
and eIF5, which are involved in fidelity of start- 
site recognition (7, 8). It also prevents pre- 
mature association of the ribosomal subunits 
(9) and regulates recruitment of the 43S PIC 
to mRNA by interacting directly with eIF4F 
(4-6). Although structures of mammalian 
eIF3 have been determined at low resolution 
(~6 A) (10), it is not clear how it coordinates 
these vital functions during translation. Ad- 
ditionally, a fundamental question remains: 
How does eIF4F and its adenosine triphos- 
phatase (ATPase) cycle promote mRNA re- 
cruitment and scanning along mRNA? In 
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particular, how is the activity of eIF4F coor- 
dinated with eIF3 in the 48S? 

In this work, we use single-particle cryo- 
electron microscopy (cryo-EM) to determine 
the structure of a reconstituted human 48S. 
Our work provides a detailed structure of eIFs 
and structural insights into how elF4F inter- 
acts with eIF3 as part of the 48S. 


In vitro reconstitution of the human 48S 
and its overall structure 


To characterize our purified reconstituted sys- 
tem, we established that mRNA recruitment 
and scanning follow an eIF4F-dependent 
pathway. To this end, start-site selection on an 
m’G capped mRNA was monitored by using 
the RelE toxin to cleave mRNA in the A site 
of the 40S subunit when a codon-anticodon 
interaction forms between tRNA; and the 
AUG codon in the P site (77) (Fig. 1, Aand B, 
and fig. S1A). The first AUG codon is pref- 
erentially selected in our system, consistent 
with the scanning model of initiation. We 
observe efficient start-site selection in the 
presence of ATP and ATP-y-S. By contrast, 
adenylyl-imidodiphosphate (AMP-PNP) ap- 
preciably reduces start-site selection, which 
is consistent with our previous work (12). The 
kinetics of start-site selection is strongly en- 
hanced by eIF4F, indicating that mRNA re- 
cruitment and scanning preferentially follow 
an eIF4F-dependent pathway. To gain insights 
into the mechanism of mRNA recruitment 
and scanning, we used single-particle cryo-EM 
to determine the structure of a 48S complex 
assembled (with ATP-y-S) on a very similar 
mRNA but without a start site (Fig. 1A and 
figs. S2 and S3). An mRNA without a start 
site was used to capture the 48S at a pathway 
intermediate after mRNA recruitment but 
before start-site selection. Because ATP-y-S 
behaves similarly to ATP in the reconstituted 
system, we reasoned that this intermediate 
most likely resembles a scanning intermediate. 
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Although cross-linking with BS3 (materials 
and methods) did not change the overall struc- 
ture we obtained (fig. S4), it increased the num- 
ber of particles containing eIFs and thereby 
improved resolution. For the analysis of sev- 
eral regions that appeared highly dynamic, 
we used multibody and focused refinement 
(fig. S5). The overall resolution of 3.1 A (figs. 
S2 and S3) allowed us to identify and place 
in the maps the previously known structures 
of the 40S, eIF1, eIF1A, eIF2 (a, B, and y), 
tRNA;M“ and the octameric structural core of 
eIF3 as well as its peripheral subunits (b, d, g, 
i,j) (Fig. 1, C to E; fig. S6; and table S1). These 
placements were used to segment the maps 
for detailed model building. Further masked 
classification on additional unaccounted den- 
sity yielded a map (fig. S2 and S3) that, despite 
having a slightly reduced overall resolution 
of 3.4 A, improved this additional density. 
On the basis of our structural and biochem- 
ical analysis (fig. S1, B and C) as well as prior 
data (4-6, 13), we identified this density to 
be eIF4F (eIF4G, eIF4A, and possibly eIF4E) 
(Fig. 1, C to E, and fig. S6). 

The structure reveals a conformation of 
the TC that does not involve codon-anticodon 
base pairing (fig. S7), which is likely to reflect 
the 48S complex in the process of scanning. 
The tRNA,“ is in a previously unseen orien- 
tation, which is intermediate between the pre- 
viously identified Pyy (in which the tRNA is 
stably base-paired with the start codon in the 
P site) and Poyr (in which the tRNA is not 
fully inserted into the P site) states (14) (Fig. 
1F). Furthermore, the 40S in this complex has 
an unusual conformation (Fig. 1, G and H). A 
downward movement of the 40S head when 
transitioning from the pre- to postscanning 
state has been described (14, 15), but we ob- 
serve an additional swivel movement of the 
head (Fig. 1H). In bacterial initiation, a head 
swivel changes the position of the tRNA“ 
from the P site to between the P and E sites 
(16). Although the movement is similar, in 
this case the tRNA;““ remains in the P site. 
The much looser interaction of the tRNA M* 
with mRNA (fig. $7), along with the absence 
of a start codon in the mRNA, suggests that 
the structure represents the conformation of 
the 48S during or just before scanning. 


A near-atomic resolution structure 
of the human elF3 in the context of 48S 


The details of the organization of the 48S and 
its intramolecular interactions were poorly 
understood in the absence of a high-resolution 
structure of the complex. The 43S part of 
our 48S structure has a resolution of ~3 A and 
includes all 13 subunits of eIF3 (Fig. 1 and 
fig. S8, A to C), most of which could be modeled 
in atomic detail. As a result, we can under- 
stand how evolutionarily conserved residues 
of eIF3 are important for interactions with 
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Fig. 1. Structure and functional characterization of human 48S complex. 
(A) RelE cleavage shows that the efficiency of start-site selection depends on elF4F 
(lanes 1 and 4) as well as on ATP hydrolysis (lines 1 and 3). The slowly hydrolysable 
ATP analog ATPyS (lane 2) has almost the same efficiency of start-site selection 
when compared with ATP. For the gel, RelE cleavage assay was performed 20 min 
after the formation of the 48S complex. The degradation band is present in the 
mRNA prep even in the absence of RelE cleavage (fig. SIA). FL, fluorescent labeled; 
nt, nucleotide. (B) Kinetic analysis in the presence of elF4F with ATP or ATPyS, and 
with ATP absence of elF4F, shows an elF4F and ATP-dependent mechanism of 
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scanning under our experimental conditions, and that ATPyS is almost as good as 
ATP. Error bars indicate SEM. (C to E) Overall structure of 48S shown in different 
orientations. elFs are shown in cartoon. tRNAM* and 40S are represented as magenta 
and gray spheres. Sugar phosphate backbone of the mRNA is shown as a blue 
surface. CTD, C-terminal domain. (F) Superposition of the tRNAM* with the structure of 
tRNAM* in the context of 48S in open (Pour) and closed conformation (Pi) (14) shows an 
intermediate conformation of the anticodon stem loop. (G@ and H) Superposition of 18S 
rRNA with the structure of 48S in the open and closed conformations (14) to highlight the 
changes in their conformation and the swivel movement of the head during scanning. 


18S ribosomal RNA (rRNA) and ribosomal 
proteins (1-proteins) and rationalize previous 
biochemical data (17, 18). 

The subunits eIF3a and eIF3c are univer- 
sally conserved across eukaryotes. They inter- 
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act with the 40S as well as mRNA through 
their RNA binding motifs (Fig. 2, A to D, 
and table S2) (18). The RNA binding motif of 
elF3a (conserved in metazoans) interacts with 
mRNA near the exit site (Fig. 2A), whereas 
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the RNA binding motif of eIF3c (conserved 
in eukaryotes) interacts with rRNA on the back 
of 40S (Fig. 2D and table S2). Additionally, the 
highly conserved residue Lys in eIF3a inter- 
acts with rRNA expansion segment 7 (ES7°) 
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Fig. 2. Interactions of elF3 with mRNA and 40S. (A) Close-up of the mRNA 
exit site highlighting the interaction of elF3a (pink) with mRNA (blue sugar- 
phosphate backbone). K, Lys. (B) elF3a interacts with ribosomal protein eS1 
(yellow) near the exit site. Red dashed lines represent protein-protein or 
protein-RNA interactions. elF3a R14-a and K23 interact with eS1 E78 and 
$192. Furthermore, elF3a K63 interacts with rRNA ES7° C1116. C, Cys; E, Glu; 
K, Lys; R, Arg; S, Ser. (€) Close-up of R14 in elF3a to highlight in two 


alternative rotamer conformations (R14-a and R14-b) and the interaction of 
R14-b with D77 of r-protein eS1. D, Asp; Q, Glin. (D) Close-up of the interaction 
of elF3c with 18S rRNA on the back of the 40S. G, Gly. (E) elF3d-NTT (orchid) 
fitted into the cryo-EM map to highlight the close interactions with PCI 
domains of elF3c (cyan surface) and elF3e (green surface). V, Val. (F) Close- 
up of the PCI domain of elF3c to highlight some described interactions 

with elF3d-NTT. A, Ala; W, Trp. 


near the exit site (Fig. 2B and table S2). The 
structure also reveals the basis of the interac- 
tion of eIF3a and eIF3c with r-proteins (Fig. 2B 
and table $2). Although the sequence register 
of our structure differs from that of a previous 
low-resolution structure of rabbit eIF3 (JO), it 
agrees with the crystal structure of yeast eIF3a 
and elF3c (fig. S8, D to H) (79). Furthermore, 
the amino acid sequence of the region of eIF3a 
that interacts with r-protein eS1 (table S2) is 
highly conserved in mammals (fig. SSF). Thus, 
it is likely that there is also structural conser- 
vation between human and rabbit eIF3a. Con- 
sistently, some interactions observed between 
human elF3a and r-protein eS1 (table S2) 
have also been partially described in a recent 
structure of rabbit eIF3 (20). 

Additionally, eIF3d interacts with both 40S 
and the octameric structural core of eIF3 
(Fig. 2, E and F; table S2; and fig S8). The N- 
terminal tail of eIF3d (eIF3d-NTT), hitherto 
unseen, interacts with the PCI domain of eIF3e 
(Fig. 2E), consistent with previous biochem- 
ical data (21). 

The structure shows that eIF3d interacts 
with the eIF3 octameric structural core, as 
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well as potentially with eIF4F. The subunit 
binds to a region of eIF3e that is also in- 
volved in binding to eIF4F, which agrees 
with predicted interaction between eIF3d 
and elF4G (4). eIF3d also interacts with eIF3c 
and probably elF3a (Fig. 2E and fig. S8B). The 
eIF3d-NTT loop (residues 33 to 59), which 
binds to the PCI domain of eIF3c, contains 
highly conserved residues such as Trp’, which 
interacts with Gin®”’ in eIF3c (Fig. 2F and 
table S2). 


elF1 binds to a mammalian-specific insertion 
in the elF3c N-terminal domain 


eIF3 coordinates start-site selection by inter- 
acting with the fidelity factors eIF1 and eIF5 
(7, 8) and prevents premature association be- 
tween ribosomal subunits (9). The N-terminal 
domain of eIF3c (eIF3c-NTD) extends toward 
the decoding center of the ribosome, where it 
interacts with eIF1 (fig. S9). 

Our structure and accompanying bio- 
chemistry (fig. S9) unexpectedly reveal that 
a conserved mammalian-specific insertion 
in elF3c (fig. S9) is involved in the interac- 
tion with elIF1. In yeast, the interaction be- 
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tween eIF3c-NTD and elIF1 occurs through 
the very N-terminal tail of eIF3c (residues 1 to 
63) (7), whereas our structures and biochem- 
ical data reveal that this interaction occurs 
through the C-terminal end of eIF3c-NTD 
(residues 166 to 287) (fig. S9). 

The resolution and completeness of the 
structure allowed us to build and assign to 
eIF3c-NTD a cluster of four helices located 
in a pocket formed by rRNA helices h11, 
h27, and h44 and ribosomal protein uS15 
(fig. S9). The main interaction with 40S oc- 
curs through the conserved and charged res- 
idues located in helix 4 (fig. S9 and table S2). 
Because this domain would clash sterically 
with parts of rRNA from the large subunit 
involved in the formation of intersubunit 
bridges B4 and eB11 (fig. $10), it should con- 
tribute to the anti-association activity of 
eIF3 (9). The same density, but at low reso- 
lution, has also been observed in yeast, in 
which it was also tentatively assigned to 
eIF3c-NTD (7, 14). Thus, the structural basis 
for the anti-association activity of eIF3c-NTD 
appears to be evolutionarily conserved among 
eukaryotes. 
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Fig. 3. Interactions of peripheral subunits of elF3. (A) elF3g-RNA binding 
motif viewed from solvent side to highlight its interaction with rRNA in helix 16 
and ribosomal proteins uS3 and eS10. (B) Binding interface between elF3g 
and the ribosomal protein uS3 at the mRNA entry channel. H, His; |, lle; L, Leu; 
Y, Tyr. (€) elF3j binds to ribosomal proteins eS30 and uS12 near the A site. The 


elF3g binds at the mRNA entry site 

eIF3g contains an RNA-recognition motif 
(eIF3g-RRM) that is thought to enhance scan- 
ning efficiency (22). In a previous low-resolution 
study, a density corresponding to RRM mo- 
tifs was observed at the mRNA entry site and 
assigned to eIF4B (23). However, this density 
was still present in a 48S complex we assem- 
bled without eIF4B (fig. S11). We have now 
assigned this density to eIF3g-RRM, although 
we cannot rule out that eIF4B can also bind 
to this region during the initiation pathway. 
In our structure, eIF3g-RRM interacts with 
rRNA hi6 and ribosomal proteins uS3 and 
eS10 (Fig. 3, A and B, and table S2). Consider- 
ing that it binds at the mRNA entry channel 
and its possible interaction with backbone of 
the mRNA, eIF3g-RRM could facilitate the re- 
cruitment of the 43S PIC to mRNA, thereby 
enhancing translational efficiency (22). 

The elIF3bgi subcomplex was previously 
suggested to bind the 40S subunit through 
an interaction between the eIF3 b-propeller 
domain and uS4 ribosomal protein (19). A 
better local resolution in our structure (figs. 
S5 and S12), together with the identification 
of the eIF3g-RRM, improves our understand- 


Brito Querido et al., Science 369, 1220-1227 (2020) 


uS3 


rRNA h16 


ABCE1 FeSD 


esio B 


elF3g RRM 


ing about the residues in the eIF3bgi sub- 
complex that interact with the 40S subunit 
(table S2). 


A dual role for elF3j in start-site selection 
and recycling 


eIF3j binds near the decoding center of the 
ribosome as predicted (24, 25), where it could 
coordinate start-site selection. The eIF3j-NTD 
and helix 5 interact with 40S body through 
ribosomal proteins eS30, uS12, and 18S rRNA. 
Additionally, eIF3j is close enough to inter- 
act with eIFIA (Fig. 3C and fig. $13). The C- 
terminal domain of eIF3j bridges the head 
with the body of 40S by interacting with the 
rRNA h34 in the mRNA entry latch (a con- 
striction in the mRNA entry channel). This 
interaction may have a regulatory role by pre- 
venting the movement of the head of 40S and 
limiting the closed conformation of the ribo- 
some (12). Furthermore, it might be associated 
with the known role of eIF3j in preventing 
leaky scanning (26). 

Human elF3j interacts with 40S in the same 
location where the mammalian translational 
auxiliary factor DHX29 has been observed 
to bind (fig. $14) (20). Thus, the recruitment 
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elF3j-NTT NBD1 


C-terminal tail of elF3j (elF3j-CTT) interacts with ribosomal rRNA in helix 34. elF3)- 
NTT is positioned next to the GTPase binding region of the 40S. (D) Superposition 
of elF3j with the structure of ABCE1 (PDB: 5LL6) bound to the 40S subunit 
(post splitting) (29). (E and F) elF3j-NTT extends toward the GTPase binding 
region of the 40S, where the nucleotide binding domain 1 (NBD1) of ABCE1 binds. 


of DHX29 during the translation of select 
mRNAs may trigger the release of eIF3j and 
overcome its possible regulatory role. This in- 
terpretation agrees with the unwinding inde- 
pendent role of DHX29 during scanning (27). 

The structure also suggests how elF3j bridges 
recycling with a new round of translation. 
The eIF3j-NTT extends toward the conserved 
GTPase binding region of the ribosome (Fig. 3, 
D to F), where ABCE1 binds. ABCE1 is an 
ATPase that is involved in recycling of eukary- 
otic ribosomes after termination. Although 
ATP hydrolysis is not required for ribosome 
splitting, it is required for subsequent re- 
lease of ABCE1 (28). The structure of ABCE1 is 
different in the 80S ribosome (presplitting), 
compared with the one bound just to the 40S 
subunit (post splitting). Superimposing our 
structure with structures of ABCE1 in the post- 
splitting complex (29) places the eIF3j-NTT 
in a position where it could interact with the 
nucleotide-binding domain of ABCE1 (Fig. 3, 
E and F). This model would be consistent with 
previous studies showing that the eIF3j-NTD 
facilitates ribosome recycling by ABCE1 (30). 
The structure suggests that the rotation of 
the iron-sulfur cluster domain of ABCE1 that 
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occurs after splitting (20, 29) would prevent 
a steric clash with elF3j (fig. S14). Thus, eI F3j 
is likely to be recruited after subunit split- 
ting to prepare the resulting 40S subunit for 
a new round of initiation. This interpreta- 
tion agrees with previous data showing co- 
purification of ABCE1 with components of the 
43S PIC, including eIF3j (29). After the release 
of ABCE1, eIF3 would prevent premature as- 
sociation of 60S. 
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and -I near the mRNA exit channel 

of the 40S subunit 

Adjacent to elF3e, -k, and -l, we see a region 
that has low local resolution due to flexibility 
(fig. SI5A) that we attribute to eIF4F. Biochem- 
ical and genetics studies have indicated that 
the recruitment of 43S to an mRNA bound to 
the cap-binding complex and the subsequent 
scanning process are greatly enhanced by in- 
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teractions between the middle domain of 
eIF4G and eIF3 (4-6). Rigid-body fitting of 
a crystal structure of a partial e[F4G-eIF4A 
complex from yeast (31) shows close agree- 
ment with the density (fig. S15), thus identify- 
ing the position of eIF4F in the 48S, consistent 
with previous biochemical data (4, 5, 32). It 
is likely that other domains of eIF4G, not visi- 
ble in this work because of possible flexibil- 
ity, could also make interactions with nearby 


elF4A-CTD 


elF3] 


Fig. 4. Interactions between elF4F and elF3 octameric structural core. 

(A and B) Rigid-body fitting (correlation = 0.92) of human homology model 

of elF4A/elF4G-HEATI into a cryo-EM map filtered to local resolution (6 to 11 A). 
(C) elF4A binds to a pocket formed by elF3l and elF3e. (D) Saturation 
binding curves showing the fraction of elF4A bound to elF3 in the presence 


Fig. 5. Blind spot for start-site recognition. RelE 1 10 
cleavage assay (materials and methods) shows 
a blind spot for start-site selection for mRNAs with 
a start codon less than 30 to 40 nucleotides 
from the 5' end. All five mRNAs tested have an 
AUG at position 50 just beyond the blind spot 
and a second AUG at positions downstream 

(60 nucleotides) or upstream (40, 30, 19, and 

10 nucleotides) of it. The main site of cleavage 

is at AUG 50 in all cases, showing that initiation 
occurs primarily at the first start codon 
downstream of the blind spot, and there is 

little or no cleavage at start codons for 10 to 

40 nucleotides. i} 
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or absence of elF4G. Error bars indicate SEM. (E) Surface representation of 
the mRNA (sugar-phosphate backbone) to highlight the path in the 48S. 

Blue dots represent a tentative path for the mRNA from the exit site (position 
14 from the P site) toward elF4A-NTD. The tentative path is based on 

weak unassigned density (fig. S16). 
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subunits such as the eIF3d-NTT and elF3e, 
which has been previously suggested by bio- 
chemical cross-linking (4). 

eIF4A binds to eIF3k and -l and eIF3e near 
the ES6S (Fig. 4, A and C), which agrees with 
recent cross-linking and immuno-EM data 
(13). To further validate the interaction between 
eIF4A and eIF3, we determined the binding 
affinity between these factors by fluorescence 
anisotropy. Consistent with our structure, 
eIF4A alone binds to eIF3, and its affinity is 
increased in the presence of eIF4G (Fig. 4D 
and table S3). We also used cross-linking mass 
spectrometry (XL-MS) to identify interactions 
between eIF4F and eIF3 in the absence of the 
48S (fig. S15). These data are consistent with 
the interactions observed in our structure. 

Our map contains unassigned density adja- 
cent to the eIF4G-eIF4A density, in close con- 
tact with eIF3k and -I (fig. S17). Although we 
cannot unambiguously assign it because of 
the low local resolution, the size and shape of 
the density are consistent with that of eIF4E. 
This location of eIF4E agrees well with our 
XL-MS data, which show that eIF4E interacts 
with eIF3k and -1 (fig. S17), as well as previous 
proximity-labeling (BioID) data, indicating 
that eIF4E and elIF3l are in close proximity 
in live cells (33). 

In our structure, eIF4F interacts with the 
43S PIC entirely through subunits of eIlF3 
that are not present in yeast (eIF3e, -k, and -1). 
The interaction we observe between eIF4A 
and eIF3k and -] is particularly surprising 
given that these individual eIF3 subunits 
are dispensable in Neurospora crassa and 
Caenorhabditis elegans (34, 35). In addition to 
a likely interaction between elF3e and elIF4G, 
our structure together with biochemical and 
genetic evidence indicates that substantial 
redundancy is likely to exist between these 
interactions. It will, therefore, be important 
to test this possibility in the future by using 
double eIF3 subunit knockouts in different 
organisms. The structure suggests that the 
interactions that e[F4F makes with other com- 
ponents of the 48S are likely to differ great- 
ly between species, the molecular basis of 
which will be important to solve with future 
structures. 

The cap-binding complex is positioned at 
the 5’ end of mRNA relative to the 40S subunit 
(Fig. 4). Translation complex profile sequenc- 
ing data indicated that the scanning 48S has 
5'-extended footprints upstream but not down- 
stream of the 40S (36, 37), which is consistent 
with the position of eIF4F in our structure. 
Nevertheless, we cannot rule out that other 
conformations of eIF4A may exist during its 
ATPase cycle and movement along mRNA. 


A blind spot in the mRNA 


In our structure, the location of eIF4F up- 
stream (to the 5’ end) of the 43S complex is 
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Fig. 6. Model for mRNA scanning during canonical translational initiation suggested by the structure. 


The elF4F at the m/G cap at the 5’ end of mRNA (A) 


recruits the 43S complex of the 40S subunit with 


initiation factors and initiator tRNAM* (B) to form the 48S complex (C). elF4F binds to the elF3 structural 
core, which places elF4E 30 to 40 nucleotides upstream of the P site of the 40S ribosomal subunit. During 
scanning, the mRNA is pulled through the 40S subunit [indicated by arrow in (C)], until the start codon is reached 
(D and E). In two alternative scenarios, elF4F could dissociate from the cap during scanning (D), or it could 
stay bound resulting in the mRNA forming a loop (E). Although not part of the structure in this work, the mRNA is 


shown with a poly(A) tail and a PABP interacting with e 


most consistent with mRNA being recruited 
to the 40S subunit by a slotting mechanism. 
Direct slotting of mRNA would also be com- 
patible with translation of circular mRNAs 
(38) as well as initiation on mRNAs contain- 
ing an internal ribosome entry site. Even dur- 
ing canonical initiation, mRNA is thought to 
be circularized by the polyadenylate [poly(A)]- 
binding protein (PABP) interacting with the 
poly(A) tail at the 3’ end and eIF4F (through 
eIF4G) at the 5’ end (39), thus favoring a slot- 
ting model for mRNA loading into the 40S. 
Such slotting would require rearrangements 
in some elements of eIF3 to make the mRNA 
channel accessible initially; however, eIF3 
is known to be dynamic, with various parts 
of its structure becoming ordered in differ- 
ent states, and in other contexts, the small 
subunit is known to close upon mRNA bind- 
ing (14). 
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FAF to reflect the situation in vivo (39). 


A slotting model of mRNA recruitment 
would result in a “blind spot” that would pre- 
clude recognition of start sites upstream of 
the location of the P site at the point of re- 
cruitment, which would be at least 30 nu- 
cleotides from the 5’ end on the basis of our 
structure (Fig. 4E). We tested for a blind spot 
in our RelE assay using a series of mRNAs 
that have a fixed start site at 50 nucleotides 
from the m’G cap, which would be down- 
stream of the blind spot, and an additional 
start site located either upstream (10, 19, 30, 
and 40 nucleotides) or downstream (60 nu- 
cleotides). Our data show that in either case, 
efficient initiation primarily occurs on the 
first start site that is encountered beyond the 
blind spot, namely at 50 nucleotides from 
the m’G cap (Fig. 5). We do observe some ini- 
tiation at a distance of 40 nucleotides from 
the m’G cap but very little or none at distances 
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of 30, 19, or 10 nucleotides. This blind spot is 
consistent with the fact that leaky scanning 
is observed on mRNAs that contain 5’ un- 
translated regions (5'UTRs) of less than 32 
nucleotides (40). We thus propose that a 
blind spot of between 30 and 40 nucleotides 
exists on human mRNAs, which would be 
compatible with the typical 5’UTR of mRNA in 
humans, whose median length is 218 nucleo- 
tides (41). 


A model for recruitment and scanning 


The structural and biochemical data suggest a 
model in which mRNA is slotted into the 40S 
just downstream of eIF4F during recruitment 
(Fig. 6). The location of eIF4A upstream of the 
40S subunit (Fig. 4) suggests that the mRNA 
is likely to be pulled through the channel in 
the 40S subunit during scanning. The model 
is economic in terms of rearrangements: Once 
the 43S PIC is recruited to the cap-binding 
complex, the entire 48S initiation complex es- 
sentially stays intact while the mRNA is pulled 
through the 40S subunit, until the start codon 
reaches the P site of the ribosome and triggers 
subsequent steps in initiation. 

The hydrolysis of ATP by eIF4A is required 
for efficient scanning (Fig. 1A), but precisely 
how ATP hydrolysis promotes scanning is not 
clear. It is possible that eIF4F could work as a 
Brownian ratchet (42) in which the 40S sub- 
unit would slide along the mRNA in a stochastic 
manner, but eIF4F would move unidirection- 
ally along the mRNA to keep up with the 40S 
in an ATP-dependent manner and act as a pawl 
or backstop to prevent reverse movement of 
the 40S subunit. In this model, the ATP would 
ensure unidirectionality of scanning, but the 
energy to melt secondary structure would 
come just from thermal fluctuations and the 
40S interaction, consistent with the finding 
that eIF4F alone is not processive (43, 44). 
Ribosome proteins uS3, uS4, and uS5 located 
in the entry channel of the small ribosomal 
subunit help to unwind mRNA secondary 
structure during translocation (45). It is, there- 
fore, possible that eIF4A may be exploiting an 
intrinsic property of the 40S subunit in a sim- 
ilar way during scanning. This model leaves 
the mRNA entry site free to bind factors known 
to facilitate translation of mRNAs with ex- 
tended secondary structure, such as DHX29, 
which has been observed at the entry site (10). 
Because there is a second eIF4A binding site 
on eIF4G, and moreover, eIF4A is present in 
excess over other components of eIF4F and 
is known to have additional roles in melting 
RNA secondary structure (46), it is possible 
that one or more additional eIF4A molecules 
could also play a role downstream of the 48S 
to facilitate translation. Therefore, pulling of 
the mRNA through the 40S may be only one 
part of a complex mechanism of unwinding 
of mRNA secondary structure and scanning. 
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The role of eIF4B also remains unclear and 
may be part of this process. 

eIF4E stimulates the helicase activity of 
elF4A (47) and thus likely remains bound to 
the rest of eIF4F throughout the process (37). 
From our structure, it is not clear whether 
the 5’ end of mRNA is released from eIF4E and 
progressively moves farther away from the 48S 
PIC during scanning (Fig. 6D). Although the 
dissociation rate of isolated eIF4E from the 
mG cap is quite high (48), its interaction with 
the rest of eIF4F could enable efficient and 
rapid rebinding of eIF4E to the 5’ cap (fig. S1, 
B and C) (49). This would require the 5'UTR 
mRNA to loop out as it is pulled through the 
40S subunit during scanning (Fig. 6E), a pos- 
sibility also previously suggested (3). Recent 
evidence to support this model has been pro- 
vided by 40S selective ribosome footprinting, 
which indicates that the scanning 48S complex 
remains tethered to the m’G cap throughout 
the scanning process in human cells (37). 

Some rare mRNAs with unusually short 
5'UTRs would place the recruited 43S PIC 
beyond the initiation codon (6). We propose 
that these types of mRNAs would likely ex- 
ploit an alternative recruitment pathway, per- 
haps occurring in the absence of eIF1 (6). Such 
mRNAs could possibly require a different role 
of eIF4F or even be independent of it. Previous 
work showing elF4F-dependent translation 
of mRNAs with very short 5‘UTRs propose 
that eIF4F binds at the entry rather than exit 
site, followed by dissociation of eIF4E from 
the cap and threading of mRNA into the 40S 
subunit through its decoding site (6). For such 
a model to be compatible with our structure, 
the eIF4F complex would need to relocate to the 
opposite side of the 40S at some stage, which we 
consider unlikely. To resolve these discrepan- 
cies, it will be important to determine structures 
of 48S complexes with other mRNAs, including 
those with a short 5’'UTR. 


Outlook 


This work reveals the structure of an essen- 
tially complete 43S PIC (in the context of 48S 
PIC) at high resolution as well as its inter- 
actions with the cap-binding complex at the 
5’ end of mRNA (movie S1). The structure sheds 
light on several important but hitherto un- 
resolved aspects of initiation, including the 
mechanism of mRNA recruitment to the 43S 
PIC and how the position of eIF4F at the mRNA 
exit channel likely facilitates the process of 
scanning. 
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PROTEIN DESIGN 


A defined structural unit enables de novo design of 
small-molecule—-binding proteins 


Nicholas F. Polizzi* and William F. DeGrado* 


The de novo design of proteins that bind highly functionalized small molecules represents a great challenge. 
To enable computational design of binders, we developed a unit of protein structure—a van der Mer (vdM)—that 
maps the backbone of each amino acid to statistically preferred positions of interacting chemical groups. 
Using vdMs, we designed six de novo proteins to bind the drug apixaban; two bound with low and 
submicromolar affinity. X-ray crystallography and mutagenesis confirmed a structure with a precisely 
designed cavity that forms favorable interactions in the drug—protein complex. vdMs may enable design 
of functional proteins for applications in sensing, medicine, and catalysis. 


he Anfinsen hypothesis states that a 

protein’s sequence encodes its tertiary 

structure and underlying function (J). 

Conversely, a protein’s tertiary structure 

encodes the possible sequences compa- 
tible with a particular function. De novo pro- 
tein design has succeeded in the creation of 
proteins that fold to various targeted tertiary 
structures (structure to sequence) (2, 3). Never- 
theless, it has been extremely challenging to 
design proteins that not only fold but also 
bind to complex small molecules (function 
and structure to sequence) (2-4). Use of algo- 
rithms optimized for packing apolar protein 
cores leads to difficulty when designing po- 
lar cavities required for binding hydrophilic 
molecules (5). Consequently, design of small- 
molecule-binding proteins has generally re- 
quired recursive experimental screening and 
large libraries to engender function, mostly 
starting with natural proteins rather than de 
novo structures (Fig. 1A) (3, 4, 6-9). Here, we 
accomplish the reverse of the Anfinsen hy- 
pothesis by simultaneously designing struc- 
ture and binding function from scratch, 
targeting a small-molecule drug with signifi- 
cant polarity and structural complexity. To 
do this, we developed a unit of local protein 
structure that directly links a tertiary struc- 
ture to key interactions that engender tight 
and specific binding. These findings illuminate 
the principles underlying the emergence and 
evolution of complex function in proteins and 
provide a methodology for designing useful 
proteins. 


Targeted function and fold 


We targeted the factor Xa inhibitor apixaban, 
an organic compound with five rotatable bonds 
and eight heteroatoms. Our first objective 
was to compute a tertiary structure capable 
of cooperatively binding the polar groups of 
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apixaban. Instead of repurposing natural bind- 
ing proteins or folds that have been shown 
to bind a similar ligand, in this work, we use 
de novo four-helix bundles because they are 
mathematically parameterized (10, 11), des- 
ignable (12), and share no similarity to the 
fold of factor Xa. Four-helix bundles gener- 
ally do not bind small molecules and instead 
bind metal ions or metalloporphyrins by strong 
coordinate bonds (J0, 13-16). However, four- 
helix bundles are tubular and can be designed 
to have high thermodynamic stability (11, 13) 
to compensate for the energetically demand- 
ing process of building binding cavities re- 
plete with buried polar functionality (77). Thus, 
the design of a de novo helical bundle that binds 
the drug apixaban critically tests the design 
method. 


The van der Mer structural unit 


The design of proteins relies on optimal pack- 
ing of interior side chains in discrete confor- 
mations called rotamers (2, 3, 18-22). However, 
the design of ligand-binding proteins addi- 
tionally requires side chains that interact 
favorably with the target small molecule. Pre- 
vious design strategies approached this prob- 
lem by computationally appending the target 
ligand to rotamers with idealized interaction 
geometries that—although composed of bil- 
lions of conformations—sampled only a small 
fraction of the possible conformational space 
(6, 8, 23). These strategies rarely deliver sub- 
millimolar binders from the initial compu- 
tational design, so subsequent steps rely on 
experimental random mutagenesis and screen- 
ing of libraries. 

We wondered how much of the vast, possi- 
ble conformational space of protein-chemical 
group interactions is actually sampled in ob- 
served protein structures and if sampling in- 
teractions directly from this distribution might 
aid the design of high-affinity binders. Whereas 
previous analyses have focused on local side 
chain contacts with chemical groups (24), we 
sought a structural unit that directly maps 
backbone coordinates to chemical group loca- 
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tions, the link between the protein fold and 
binding function. We developed a unit of pro- 
tein structure analogous to rotamers—a van der 
Mer (vdM)—that defines the placement of key 
chemical groups in the ligand relative to the 
backbone atoms of the contacting residue (Fig. 
1B). vdMs are culled from a nonredundant set 
of protein structures by (i) identifying all 
residues of a certain type that interact with 
a particular chemical group, (ii) performing 
an all-by-all pairwise superposition of only 
the backbone and chemical group coordi- 
nates (side chains are not considered in the 
superposition, allowing some variation in 
their conformation), and (iii) geometric clus- 
tering with a tight root mean square deviation 
(RMSD) cutoff (0.5 A). The resulting vdMs 
show backbone ¢ and y dependence (Fig. 1C) 
and capture compensatory effects of backbone 
and chemical group placement. Furthermore, 
single clusters may contain multiple rotamers 
(Figs. 1D and 6A and fig. S1), given that side- 
chain coordinates are not explicitly considered 
in clustering. 

The use of vdMs contrasts with procedures 
that place ligands at idealized locations rela- 
tive to the terminal atoms of a side chain 
(6, 8, 23, 25), which results in vast numbers 
of ligand-rotamer combinations that might 
never occur in proteins. Instead, vdMs sam- 
ple locations of chemical groups relative to 
the backbone that have been experimental- 
ly vetted to achieve favorable interactions. 
They also implicitly consider interactions 
with ordered or bulk water, which might in- 
fluence their interaction geometries. More- 
over, unlike ligand-appended and inverse 
rotamers used in earlier approaches (6, 8, 23, 25), 
vdMs may be derived from contacts with 
either main chain, side chain, or both in a 
multivalent interaction. Finally, the preva- 
lence of a given vdM in the Protein Data Bank 
(PDB) can be used in scoring functions, sim- 
ilarly to scoring rotamers, which may assist 
automated selection of binding-site residues 
for design. 

To maximize the number of observed 
protein-chemical group contacts, we created 
vdMs using the chemical groups of amino 
acids that constitute the protein (e.g., CONH» 
of Gln and Asn and N-H and C=O of back- 
bone amides). To avoid bias from local struc- 
ture, we counted only the interactions that 
were distant in the linear polypeptide chain, as 
described in the supplementary materials. The 
set of chemical groups can also be expanded 
to include those from small-molecule drugs, 
metal ions, and cofactors, although these are 
not as pervasive in crystal structures. 

We ranked vdMs by their prevalence in the 
PDB using a log-odds score, C (Fig. 1, D and 
E; fig. S2; and supplementary text). Although 
there are hundreds of vdMs associated with 
a given residue-chemical group combination 
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group position to the protein backbone. (A) Workflow 
of COMBS. (B) Definition of a vdM. A chemical 


group is interacting if it is in van der Waals contact with the protein side chain or main chain. Like 
rotamers, vdMs are derived from a large set of high-quality protein crystal structures. A vdM of aspartic 
acid (Asp) and carboxamide (CONH», cyan) is shown. (C) vdMs are ~, w, and rotamer dependent; this 
is illustrated by the top vdMs of the m-30 rotamer of Asp, clustered by location of CONH> after exact 
superposition of main chain N, Ca, and C atoms. (D and E) We ranked vdMs by prevalence in the PDB, 
quantified by a cluster score C [the natural logarithm of the ratio of the number of members in a cluster 
(Nvam) to the average number of members in a cluster ((Nvam))]. The seventh-largest cluster of 


Asp/CONH> vdMs is shown as an example in (D). 


(figs. S3 and S4), only a small fraction of 
vdMs are highly enriched in protein struc- 
tures (C > 0). For example, only 91 Asp CONH, 
vdMs have C values >0; these top vdMs map 
the locations of CONHg, relative to the back- 
bone of an Asp residue, that are statistically 
preferred by proteins in the PDB (Fig. 1E and 
fig. S2D). This is on the order of the number 
of rotamers used for an amino acid during 
a typical protein design packing calcula- 
tion (26). Thus, when combined with an ef- 
ficient search algorithm, sampling protein- 
chemical group interactions with vdMs to 
design ligand-binding sites might be as ex- 
pedient as sampling rotamers to pack a pro- 
tein core. Furthermore, functionally relevant 
lower-probability rotamers may be included 
if contained in a high-scoring vdM. 
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Proteins use the same set of 20 amino acids 
to fold as well as to recognize a vast array of 
highly functionalized ligands. We therefore 
hypothesized that the interaction modes used 
by amino acids to stabilize their tertiary struc- 
tures would also be used to achieve tight 
binding of ligands, even those containing 
structurally distinct heterocyclic chemical 
groups. To test this hypothesis, we examined 
the streptavidin-biotin complex (Fig. 2). Using 
the natural sequence of streptavidin, we ex- 
amined the positions of vdMs of N-H, C=O, 
and COO’, where these groups were derived 
from protein main chains and side chains. In 
each case, we observed that the side-chain in- 
teractions with biotin’s polar groups involved 
highly favorable vdMs, with enrichment scores 
of ~8-fold or greater (C > 2). The streptavidin 
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sequence-fold pairing cooperatively positions 
highly favorable vdMs to cover each polar 
chemical group of biotin simultaneously. 

Our analysis of the streptavidin-biotin com- 
plex suggests that binding sites can be de- 
signed by considering folds that position vaMs 
to collectively bind the distinct chemical groups 
found in a target small-molecule ligand. More- 
over, the vdMs of the binding site should be 
maximally prevalent in the PDB. We devel- 
oped a search algorithm, called Convergent 
Motifs for Binding Sites (COMBS), to dis- 
cover favorable poses of a ligand that satisfy 
these criteria. 


De novo design strategy 


Our design strategy consists of several hier- 
archic steps, which prioritize the most es- 
sential and difficult features to avoid sampling 
regions in sequence and structure space with 
little chance of success (fig. S5). First, we de- 
fine the chemical groups within the small 
molecule that will be targeted. We initially 
focus on polar chemical groups, which are 
the most challenging to dehydrate but must 
be satisfied with H-bonds to achieve high af- 
finity and specificity (27). Second, we choose 
a designable protein fold and create an en- 
semble of backbones with geometries that 
are consistent with the known plasticity of 
the fold. Next, for each backbone we use 
COMBS to identify members of the backbone 
ensemble that can position vdMs to collec- 
tively engage each of the targeted chemical 
groups of the small molecule. In this way, 
the binding of the desired ligand dictates the 
precise backbone geometry. Having discovered 
candidate backbones and binding sites, the 
design is completed by engineering a tightly 
packed folding core that supports the vdM- 
derived keystone interactions in the binding 
site (73). In this step, we constrain the key- 
stone interactions and use flexible backbone 
design (13, 26) to pack additional residues 
within the binding site while simultaneously 
packing the protein core. 

We focused on apixaban’s carboxamide (both 
the C=O and -NHg), as well as two additional 
carbonyls (Fig. 3A). (Other groups that were 
internally H-bonded or easily dehydrated were 
not initially targeted.) We created a set of 
vdMs of carboxamide (CONH, from Asn and 
Gln side chains) and carbonyl [C=O from the 
protein backbone (supplementary text)] and 
used these vdMs to discover preferred CONH, 
and C=O binding locations within a set of 32 
mathematically generated de novo polyglycine 
backbones (10, 28) (Fig. 3, B and C; fig. $2; 
and table S1). For each of the mathematically 
generated backbones, we placed apixaban in 
the protein interior by using a separate set 
of vdMs with apixaban superimposed onto 
the chemical group of the vdM. For example, 
the CONH, of apixaban can be superimposed 
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on the CONH, of a vdM, uniquely defining 
the position of apixaban in the binding site. 
Apixaban’s conformation in this step was fixed 
in alow-energy conformer found in its cocrystal 
structure with factor Xa (PDB code 2p16) (Fig. 
3A and fig. S6; extension to multiple con- 
formers is discussed in the supplementary text 
and illustrated in fig. $7). vdMs that cover the 
remaining C=O groups of the placed ligand, as 
well as additional vdMs to the carboxamide, 
were then queried in the nearby space (Fig. 
3D). We chose binding poses by maximizing 
the PDB prevalence of sterically compatible 
vdMs (°C) (Fig. 3E). 

Side chains from vdMs in six selected bind- 
ing poses were fixed, and their H-bonding in- 
teractions with apixaban were constrained in 
all subsequent steps of sequence design per- 
formed within the Rosetta modeling suite. 
After insertion of interhelical loops, we used 
a flexible backbone design protocol (73) (Fig. 
3F) to compute the hydrophobic core while 
simultaneously completing the packing of the 
binding site. For some designs, new polar in- 
teractions were recruited during this step, as 
were Gly residues, which are known to interact 
favorably with aromatic groups (27). The use 
of small residues to make hydrophobic con- 
tacts minimizes the number of large, apolar 
side chains that might lead to nonspecific 
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Possible COO- locations 
based on proteins in PDB 


vdMs capture biotin binding mode 


binding or hydrophobic collapse in the absence 
of ligand. 


Description of designs and 
biophysical characterization 


We designed six proteins of varying length, 
topology, ligand position, ligand burial, and 
keystone interactions (fig. S8). By contrast 
to factor Xa, which engages polar groups of 
apixaban through main-chain amides in loops 
(fig. S6), the designs interact with apixaban 
using predominantly side chains in helices. 
The six designs were well-expressed in bac- 
teria, and each was helical based on far UV 
circular dichroism spectroscopy (fig. S9). Pro- 
ton nuclear magnetic resonance (NMR) showed 
that two designs, ABLE (apixaban-binding he- 
lical bundle) and LABLE (longer ABLE), bound 
apixaban (fig. S10). These two designs had the 
same orientation of apixaban within the bun- 
dle and shared the same vdM-derived key- 
stone interactions (Fig. 3E and fig. S8). For 
example, they shared a buried, high-scoring 
His/C=O vdM (8-fold enrichment, C = 2.1) 
(Fig. 3E). However, ABLE and LABLE differed 
in length (125 vs. 165 residues), topology, and 
loop geometry and shared only 22% sequence 
homology. 

Binding of apixaban to ABLE restricts the 
drug’s conformation, resulting in a red shift of 


4 September 2020 


Fig. 2. Prevalent vdMs describe the binding 
site of biotin in streptavidin. (A and B) We 
constructed vdMs of the polar chemical groups 
of biotin by searching the PDB for protein 
interactions with the (i) backbone amide nitrogen 
(N-H), (ii) backbone carbonyl or carbonyl from 
Asn or Gln side chains (C=O), and (iii) carboxylate 
of Asp or Glu side chains (COO ). (C) Using 

the native sequence of streptavidin, vdMs were 
sampled on the streptavidin backbone to 
generate possible locations for productive 
interactions with the chemical groups. Here, 

Asn and Ser vdMs of COO” are sampled at 

two positions of the backbone. (D) vdMs with 
chemical groups (cyan) that are nearest 
neighbors (0.6 A RMSD) to those of biotin in 

its binding site are overlaid on top 

of biotin (purple). 


its electronic absorbance spectrum (Fig. 3G). 
Spectral titrations and fluorescence polariza- 
tion competition experiments showed that 
ABLE and LABLE bind apixaban with a dis- 
sociation constant (Kp) of 5 (+ 1 [SEM]) nM 
and 0.6 (+ 0.1) uM, respectively (Figs. 3H 
and 4D; figs. S11 and S12). Although LABLE 
showed a dispersed two-dimensional ‘H-°N 
heteronuclear single-quantum coherence spec- 
trum by NMR (fig. S13), indicative of a well- 
structured protein, it failed to crystallize in a 
sparse matrix screen, and so we focused our 
attention on characterization of ABLE. ABLE 
is monomeric in solution (fig. $14) and highly 
stable to heat denaturation (melting temper- 
ature of >95°C), despite the inclusion of three 
Gly and a polar His within its core (fig. S15). 


Structures of apixaban-bound and drug-free ABLE 


ABLE readily crystalized with apixaban and 
diffracted to 1.3-A resolution. Two very closely 
related monomers were observed in the asym- 
metric unit (fig. $16); apixaban is bound to 
both monomers, as expected for a specific, 
high-affinity complex. The structure of the 
drug-bound protein is in excellent agreement 
with the design (Ca RMSD of 0.7 A) (Fig. 4). 
The rotamers of the core residues of ABLE, 
including the binding-site residues, over- 
whelmingly agree with the design model. 
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Superimposing by all heavy atoms of core 
amino acids, including apixaban, gives an 
RMSD of 0.98 A. ABLE buries almost all avail- 
able apolar surface area (504 A”) of apixaban, 


and it also forms most polar interactions in- 
cluded in the design (Fig. 4, B and C). Apixaban’s 
conformation is close to that used in the design 
(0.6 A heavy atom RMSD), with small devia- 
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Fig. 3. Apixaban-binding helical bundle (ABLE) design strategy. (A to F) Steps of the design process. 


(A) We targeted simultaneous engagement of two carbonyls (C=O) and the carboxamide (CONH») of apixaban. 


(B) We computationally generated a set of 32 designable four-helix bundle folds based on a mathematical 


parameterization. (C) vdM sampling of CONH2 and C=O allowed us to enumerate statistically preferred locations 


of these chemical groups relative to the backbone. (D) We used a precomputed set of vdMs with apixaban 


superimposed by one of its chemical groups to position apixaban within the bund 
to have at least one vdM that accommodates its position. Chemical groups of vd 


le, such that it was guaranteed 
s that overlap with those 


of apixaban are found by a nearest-neighbors lookup. Multiple vdMs contributing from one residue position are 


possible, e.g., His/C=O and Trp/C=0 vdMs, and can be used in separate designs. ( 
each chemical group of the ligand were made by maximizing the use of highly en 


site (high C score) (Fig. 1, D and E). Final ligand positions and interactions for the si 


E) Specific choices of vdMs for 
riched vdMs in the binding 
Xx experimentally characterized 


designs were chosen by maximizing both C and the burial of the apolar surface area of apixaban. The vdMs 
chosen to comprise the binding site of ABLE are shown along with their cluster scores. (F) The location of 


apixaban and its vdM-derived interactions with the protein are constrained in a subsequent flexible backbone 
sequence design protocol. (G) The electronic absorbance spectrum of apixaban is red-shifted upon binding to 
ABLE. The black spectrum shows apixaban (4 uM) in buffer containing 50 mM NaPi, 100 mM NaCI (pH 7.4). 
The red spectrum is the difference of the absorbance spectrum of ABLE alone (20 uM) and the spectrum of 
ABLE (20 uM) with apixaban (4 uM). The spectra were normalized to the peak maximum for comparison. 

These experiments were facilitated by the high extinction coefficient of apixaban and the lack of Trp in ABLE. 


(H) Global fit of a single-site binding model to the absorbance changes at 305 nm 


upon titration of apixaban into 


5, 10, and 20 uM solutions of ABLE. The Kp from the fit is 5 (+ 1) uM, which was confirmed by fluorescence 


polarization competition experiments (supplementary materials). 
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tions that bring it closer to a quantum me- 
chanically optimized geometry (fig. S17). The 
rigid body translation between apixaban’s 
center of mass in the designed versus that in 
observed structures is only 0.2 A, with a rigid 
body rotation of 6°. The bespoke binding site is 
specific for apixaban, as shown by fluorescence 
polarization competition experiments (Fig. 4D), 
which indicated that ABLE binds apixaban 
20-fold more tightly than a similar factor Xa 
inhibitor, rivaroxaban. 

To assess the extent of preorganization of 
the protein, we also solved the drug-free struc- 
ture to 1.3-A resolution (Fig. 5). The structure 
shows an open, preorganized binding pocket, 
with an overall Ca RMSD of 0.65 A to the 
apixaban-ABLE complex. The unoccupied 
binding site is solvated by nine ordered water 
molecules plus an acetate from the buffer 
(Fig. 5D). Binding of apixaban displaces 
ordered solvent from this site, suggesting a 
release of local frustration upon binding. The 
pocket has a 480-A” solvent-exposed surface 
area, which expands by 40% to accommo- 
date the drug (680 A’). The drug-free protein 
has nearly identical rotamers to that of the 
drug-bound protein throughout the core and 
binding site (Fig. 5, G to I). Unliganded ABLE 
shows two alternate rotamers for several of 
the residues that form H-bonds to apixaban 
(e.g., Tyr*® and His*®); binding of apixaban 
selects one each of these alternate rotamers. 
Thus, like many natural proteins (29), ABLE 
has a limited degree of flexibility, which is 
reduced upon ligand binding, and the binding 
event appears to trade configurational entropy 
for enthalpically favorable interactions. 


Insights from the structure and function of ABLE 


Two of the three keystone interactions iden- 
tified by COMBS contribute appreciably to 
binding affinity. Substitution of His*® or Gln 
with alanine individually decreases affinity 
by approximately 1 kcal/mol (~3-fold) (Fig. 
6D and fig. $18). Gln was observed in its 
intended rotamer, whereas His*® occupied an 
alternate rotamer that nevertheless main- 
tained the intended position of apixaban’s 
carbonyl relative to the main chain (Fig. 6A). 
Indeed, the cluster describing this His/C=O 
vdM contains multiple His rotamers, each 
capable of achieving identical placements of 
C=O relative to the main chain. Thus, we ob- 
served vdM convergence, even amidst rotamer 
divergence. 

We also examined the structural conse- 
quences of substituting His*® with Ala by 
solving the crystal structure of the unliganded 
His*°—Ala (H49A) mutant protein (fig. $19). 
Although the structures of drug-free ABLE and 
drug-free H49A are similar (Co. RMSD = 1.2 A), 
the residues that surround His*® show rota- 
meric differences in the absence of this side 
chain; released from the restraints of tight 
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Fig. 4. The structure of apixaban-bound ABLE agrees 
with the design. (A) Superposition of backbone Ca atoms 
of structure (protein in orange, apixaban in purple) and 
design (gray; 0.7 A RMSD), showing side chains of amino 
acids in the protein core. (B) ABLE’s binding site from the 
structure (1.3-A resolution), showing vdM-derived interac- 
tions with apixaban (purple). The 2mFo-DFc composite omit 
map is contoured at 1.5 o. The map was generated from a 
model that omitted coordinates of apixaban. The protein 
backbone of these residues is shown in cartoon format. 
(C) Overlay of designed interactions (gray), after the 
designed model was superimposed onto the Ca atoms of 


Gint4' 

drug-bound ABLE 
vs 

design = Thr112 


the structure (protein in orange, apixaban in purple). 
(D) Fluorescence anisotropy competition experiments 


FITC) (supplementary text and fig. S9) is dislodged by 


the fraction bound by use of a one-site binding model 


(supplementary text). The ABLE concentration was 20 uM, 


(485-nm excitation, 528-nm emission) showed that ABLE 
binds apixaban specifically. The bound fluorophore apixaban— 
polyethylene glycol-fluorescein isothiocyanate (apixaban-PEG- 


addition of competing ligand. Anisotropy was converted to 


Fraction bound 


and the apixaban-PEG-FITC concentration was 25 nM in buffer 


containing 50 mM NaPi, 100 mM NaCl (pH 7.4). Apixaban 


COO’ is identical to apixaban except that it contains a 
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carboxylate instead of a carboxamide (circled). Rivaroxaban is another inhibitor that also binds tightly to factor Xa by using the same binding mode as apixaban but shows only 
very weak binding to ABLE. Fits to a competitive binding model are shown in red. Kp values: rivaroxaban, 130 (+ 10) uM; apixaban COO”, 50 (+ 5) uM; apixaban, 7 (+ 2) uM. 


Fig. 5. Drug-free ABLE has a preorganized structure with an open A 
binding site competent for binding. (A) A slice through a surface 
representation of the 1.3-A resolution structure of unliganded ABLE shows 
an open binding cavity. (B) Same slice, shown for the structure of binding 
apixaban-bound ABLE. (€) The Co. atom backbone superposition of 
unliganded and liganded ABLE. Colored squares surrounding the structure 
correspond to panels in (G), (H), and (1), looking down from the top. 

(D) The binding site of drug-free ABLE shows nine buried, crystallographic 
waters (red spheres, occupancy > 0.9) involved in an extensive H-bonded 
network with binding-site residues Tyr®, Gin’, Tyr“®, and His*®. The 
2mFo-DFc electron density map of drug-free ABLE is contoured at 1 o. 
An acetate (Act) group from the crystallization condition H-bonds 

with His*®. His’? and Tyr“° are observed with alternate rotamers. (E) Same 
view as in (D) but with the addition of the corresponding residues 
from the apixaban-bound structure, after an all-Co-atom backbone 
superposition. The 1-o 2mFo-DFc electron density (purple) of apixaban 
from the drug-bound structure shows where the crystallographic 

waters bind in the ligand-free structure relative to the bound structure. 
A water (shown as an orange sphere) mediates the H-bond between Tyr“® 


"? 


drug-free E 
to 
electron 


binding pocket 


and apixaban. This water is not observed in the unliganded structure. 


(F) Binding of apixaban in the drug-bound structure displaces all of the 
nine buried waters in the drug-free structure. Stick renderings, as well as 
the surface background, show the binding site of the ABLE-apixaban 
complex. (G and H) Binding-site overlay of liganded (orange, apixaban 
purple) and unliganded (cyan) ABLE shows preorganized rotamers. 

(I) The remote folding core contains identical rotamers in drug-free and 
drug-bound ABLE, predisposing the drug-free protein for binding. 


packing, they instead adopt their preferred 
rotamers. The structure illustrates that glo- 
bal packing of core residues supports the 
positioning of a key functional group, even 
when this requires local frustration at indi- 
vidual sites. 
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Substitution of the third keystone residue, 
Thr'”, with Ala resulted in little change in 
affinity (Fig. 6D). In the complex, its side 
chain did not form the intended H-bond to 
apixaban but instead formed an intrahelical 
H-bond to a backbone carbonyl (Fig. 6C). 
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The intended Thr/C=O vdM is favored in the 
backbone-independent vdM library used in 
the design of ABLE, but it is disfavored in a 
backbone-dependent vdM library. The lack of 
engagement with apixaban’s carbonyl resulted 
in some disorder of the terminal oxopiperidine, 
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Fig. 6. Design inferences from the structure and function of 
ABLE. (A) Exact sidechain positioning is not necessary for 

precise placement of ligand chemical groups relative to 

the mainchain. The placement of the C=O chemical group of 
apixaban relative to the backbone of residue 49 is exact (0.25 A 
RMSD). The His*?/C=O vdM from the design (green) (Fig. 3E) was 
superimposed onto His“? (orange) of the drug-bound ABLE structure 
through use of backbone atoms (N, Ca, C atoms, spheres). This 
backbone superposition places the C=O group of the original vdM 
precisely (0.25 A RMSD) onto that of apixaban (purple) in the 
structure. The cluster describing the His C=O vdM, shown 

beneath, contains multiple rotamers of His that achieve the same 
placement of C=O relative to the position of the backbone. The 
rotamers of His“? in the structure and His from the original 

vdM are both observed in the cluster. (B) Flexible backbone sequence 
design (Fig. 3F) resulted in recruitment of two additional polar 
interactions with apixaban from Tyr® and Tyr*®. A Tyr°/CONH> vdM is 
prevalent in the PDB, whereas the Tyr*°/C=0 interaction is not 
found in the database. (C) A water mediates an H-bond between 
Tyr“® and the C=O group of apixaban. Thr!2* H-bonds the C=O of the 
helix backbone at residue 108. (D) Relative binding affinities of 
ABLE mutants with apixaban-PEG-FITC fluorophore by fluorescence 
anisotropy experiments (supplementary text and fig. S18). 
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which has higher b-factors and two alter- 
nate conformations (related by a 180° ring flip) 
in the structure (fig. S16). Thus, backbone- 
dependent vdM libraries should be used in 
future applications. 

Flexible backbone sequence design of ABLE 
recruited two Tyr residues that interact with 
apixaban (Fig. 6B). One of these interactions 
was represented in the vdM database (Tyr°/ 
CONH,, C = 0.4), but the other (Tyr*®/C=0) 
was not. The structure of drug-bound ABLE 
confirmed the H-bond of Tyr°/CONH) (Fig. 4C), 
but an unanticipated water enters the binding 
site to mediate an H-bond between apixaban 
and Tyr“ (Fig. 6C). Furthermore, substitution 
of Tyr° with Phe or Ala was more destabilizing 
than the same substitutions for Tyr“, tracking 
with prevalence in the PDB. Thus, vdMs can be 
used to filter and rank interactions obtained 
using a variety of computational methods (30). 

Finally, we wondered if ab initio folding 
predictions (26) might distinguish between 
successful versus unsuccessful designs. Of the 
six designs, only two—ABLE and LABLE—were 
predicted by folding simulations to maintain 
uncollapsed binding sites (fig. S20). More- 
over, the lowest-energy models predicted from 
ab initio folding simulations of ABLE’s se- 
quence largely agreed with the crystallographic 
structure (fig. S20A). Thus, ab initio folding may 
be useful as a screen to ensure that designs main- 
tain an open, preorganized site. These results 
emphasize the degree to which the folding and 
binding problems are intimately coupled. 


Conclusion 


Previously, the design of de novo proteins 
that bind in a shape-selective manner to rigid, 
flat, hydrophobic dyes or lipidic metabolites 
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had been possible, but binding flexible mole- 
cules replete with polar atoms has been more 
challenging (4, 8, 37-33). Natural proteins bind 
highly functionalized ligands by first accruing 
the ability to weakly bind fragments within 
the context of a particular fold (34-36). To 
mimic this process, we developed the vdM 
structural unit to directly link the protein fold 
to statistically preferred binding modes of 
chemical groups. We sampled vdMs on the 
backbone of a designable four-helix bundle to 
create constellations of chemical groups that, 
when matched with the shape of apixaban, 
defined the binding site. This contrasts with 
previous approaches that search for positional 
matching of whole ligands, sampled using ide- 
alized interaction geometries. Such approaches 
are highly sensitive to small changes in the 
interaction geometries and thus require an 
enormous amount of sampling to discover 
possible binding solutions, many of which may 
contain interactions not observed in the PDB. 

vdMs sample from the experimentally vetted 
distribution of observed protein structures. 
vdMs are surprisingly sparse and discrete (Fig. 
1E and figs. S3 and S4), and they enable facile 
sampling of sequence space to discover con- 
vergent combinations of keystone interactions 
(supplementary text and fig. S2). We consider 
only the backbone and the orientation of the 
pendant chemical group, which obviates the 
need to enumerate a large ensemble of ligand- 
appended rotamers for each amino acid type 
at each position of the sequence. We focused 
here on simple, fully de novo scaffolds rather 
than redesigning the specificity of natural 
ligand-binding proteins, because we wished 
to address the challenge of designing function 
entirely from scratch. Indeed, ABLE shares 
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no sequence homology to any known proteins 
(BLAST E value of <0.42 against the nonre- 
dundant protein sequence database nr). We 
used only prevalence to rank vdMs and choose 
binding sites, but we suspect the true power of 
vdMs may lie in higher-order correlations of 
the interactions. 

COMBS and vdMs can now be used for a 
variety of protein engineering applications and 
in full partnership with experimental optimi- 
zation strategies for exploring sequence space. 
We anticipate that vdMs can also be used to 
predict chemical group hot spots of proteins 
with fixed sequence. vdMs may also enable 
design of protein-protein interfaces in a self- 
consistent manner. Finally, because vdMs 
sample from the distribution of evolved inter- 
action geometries observed in protein structures, 
it is tempting to view the chemical group con- 
stellations constructed by vdMs as a structural 
hypothesis of the evolutionary path to acquire 
binding within the context of a given fold. 
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STELLAR ASTROPHYSICS 


A triple-star system with a misaligned and warped 
circumstellar disk shaped by disk tearing 
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Young stars are surrounded by a circumstellar disk of gas and dust, within which planet formation can 
occur. Gravitational forces in multiple star systems can disrupt the disk. Theoretical models predict 
that if the disk is misaligned with the orbital plane of the stars, the disk should warp and break into 
precessing rings, a phenomenon known as disk tearing. We present observations of the triple-star 
system GW Orionis, finding evidence for disk tearing. Our images show an eccentric ring that is 
misaligned with the orbital planes and the outer disk. The ring casts shadows on a strongly warped 
intermediate region of the disk. If planets can form within the warped disk, disk tearing could provide 
a mechanism for forming wide-separation planets on oblique orbits. 


tars form through the fragmentation 

and collapse of molecular clouds. The 

most frequent outcome of this process 

is a gravitationally bound multiple star 

system, such as a binary or triple (J, 2). 
As the system evolves, the stars interact dy- 
namically with each other and with the sur- 
rounding disk of gas and dust, which holds 
material that could either accrete onto the 
stars or form planets. Numerical simulations 
(3, 4) have predicted that a hydrodynamic 
effect, known as disk tearing, will occur in 
disks around multiple systems if the orbital 
plane of the stars is strongly misaligned with 
the disk plane. Gravitational torque from the 
stars is then predicted to break the disk into 
several distinct planes, forming rings. These 
rings are expected to separate from the disk 
plane and precess around the central stars 
(5). Misaligned disks have been previously 
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observed, but it has not been possible to di- 
rectly link them to disk tearing, either because 
of the nondetection of the perturbing star(s) 
[for example, (6)] or insufficient constraints 
on the orbit [for example, (7-9)]. 

We present observations of GW Orionis, a 
young [1.0 + 0.1 million years old (J0)] triple- 
star system located in the (4 Orionis region of 
the Orion Molecular Cloud, whose central cluster 
is at a distance of 388 + 5 parsec (17). The GW 
Ori system consists of a close [1.2 astronomical 
units (au)] binary with a ~242-day period on 
a nearly circular orbit (stars GW Ori A and 
GW Ori B) (72, 73) and a third star that orbits in 
~11 years at ~8 au separation (GW Ori C) (/4, 15). 

We monitored the orbital motion of the 
system over 11 years using near-infrared inter- 
ferometry (1.4 to 2.4 um thermal continuum 
emission) (fig. S8). Fitting an orbit model to 
these observations results in tight constraints on 
the masses of the three stars [GW Ori A, 2.47 + 
0.33 solar masses; GW Ori B, 143 + 0.18 solar 
masses; and GW Ori C, 1.36 + 0.28 solar masses] 
and the orientation of the orbits (76). The 
orbits of the inner pair (A-B) and the tertiary 
(AB-C) are tilted 13.9 + 1.1° from each other. 

We imaged the system using submillimeter 
and near-infrared interferometry, which trace 
thermal dust emission, and using visible and 
near-infrared adaptive-optics imaging po- 
larimetry, which trace scattered light. These 
observations allow us to constrain the dust 
distribution in the system. Combining these 
techniques enabled us to constrain the three- 
dimensional orientations of the disk compo- 
nents and search for disk warping. The cold 
dust (down to ~10 K dust temperature, traced 
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by 1.3-mm continuum emission) is arranged 
in three rings. The two outer rings (with radii 
of 334 + 13 and 182 + 12 au) (Fig. 1A, Rl and 
R2) are centered on the A-B binary and seen 
at inclinations of 142 + 1° and 143 + 1° froma 
face-on view. This corresponds to retrograde 
rotation (in clockwise direction on the sky), 
with the eastern side tilted toward us by 38° 
and 37° for R1 and R2, respectively. The third, 
innermost ring R3 has a projected radius of 
43.5 + 1.1° au and appears more circular in pro- 
jection than R1 and R2. R3 is offset with respect 
to the center of mass of the system (Fig. 1B). 
Dust emission is apparent between the rings 
as well as inside R3, with a factor of ~10 lower 
flux density than in the neighboring rings. 

Our infrared polarimetric images show asym- 
metric scattered light extending from ~50 to 
~500 au. The scattered light forms four arcs, 
Al to A4 (Fig. 1, C and D), with the eastern 
side appearing brighter than the western 
side. This is consistent with the eastern side 
of the disk facing toward Earth. The dimmer 
regions separating the arcs Al, A2, and A3 
coincide with the dust rings R1, R2, and R3, 
respectively, seen in the submillimeter im- 
age. We interpret this as a shadowing effect 
in which the increased disk scale height at the 
location of dust rings R1, R2, and R3 casts a 
shadow on the flared disk (fig. S3) (16). We 
interpret arcs A3 and A4 as parts of a single 
elliptical structure, whose semimajor axis 
orientation [along position angle (PA) ~30°, 
measured east of north] deviates from the 
orientation of the outer disk (which has PA 
~0°). Two sharp shadows, S1 and S2, extend 
in the radial direction. The eastern shadow 
S1 changes direction at ~100 au separation 
(Fig. 1D), running south at radii <100 au (PA 
~180°% labeled S1;,ne,) and southeast at larger 
radii (PA ~135°, labeled Souter). Two broader 
shadows extend in the north-northwest (S3) 
and southwest directions (S4). A filamentary 
scattered-light structure F,.,, extends from the 
innermost arc (A3) toward the stars (Fig. 1D). 

The outer rings R1 and R2 are closely aligned 
with respect to each other but strongly mis- 
aligned with the orbital plane of the stars, as 
previously suggested on the basis of disk gas 
kinematics (15). Several physical mechanisms 
could have produced this misalignment, in- 
cluding turbulent disk fragmentation (/7), 
perturbation by other stars in a stellar clus- 
ter (8), the capture of disk material during a 
stellar flyby (19), or the infall of material with a 
different angular momentum vector from that 
of the gas that formed the stars (20, 27). The 
innermost ring R3 is strongly misaligned with 
both the outer disk and the orbits because of 
dynamical interaction with the inner multi- 
ple system. 

We built a three-dimensional model, aiming 
to reproduce both the on-sky projected shape 
of the dust rings and the shadows seen in 


lof5 


RESEARCH REPORT 


Fig. 1. Imaging of the 
disk components around 
GW Orionis. (A and B) The 
1.3-mm thermal dust 
continuum emission on 
different spatial scales, 
measured in the spectral 
flux density unit millijan- 
sky (mJy). The main 
components seen in the 
images are labeled, includ- 
ing three rings (R1, R2, 
and R3), an asymmetry in 
he ring R3 (R3asym), 

and dust emission close 
to the stars (Dag and 
Dc). (C and D) Near- 
infrared (C) and visible- 
wavelength (D) scattered 
ight, where the images 
have been multiplied by r 
to emphasize structures 
in the outer disk, where r 
is the distance from 

the stars in the image. 
Four arc structures (Al, 
A2, A3, and A4) and a 
filamentary structure 
(Fscat) are labeled. There 
are four radial shadows 
(SI, S2, S3, and S4); 

S1 changes orientation, 
with a different position 
angle within and outside 
100 au (Slinner aNd Slouter, 
respectively). In (B) to 0.0 
(D), the orbits and 
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positions of the stars at the time of observation are indicated by blue (GW Ori A), orange (GW Ori B), and white (GW Ori C) curves and symbols. The gray ellipses in 
the bottom left of (A) to (D) indicate the angular resolution (beam) achieved by the observation. In (A) to (D), north is up and east is left, as indicated in (B). 


scattered light. On the basis of hydrodynamic 
simulations [for example, (4, 22)] and the 
detection of lower-density dust between R2 
and R3 in our submillimeter image, we mod- 
eled this region as a warped dust filament 
that extends smoothly from ring R2 to a 
break radius, where the warp is truncated. 
Our models show that material at this inner 
truncation orbit appears in the scattered 
light image as the ellipse formed by arcs A3 
and A4 (16). The warped part of the disk 
facing away from Earth is located southeast of 
the stars and is fully illuminated by them, 
appearing as arc A3 (Figs. 1C and 2C). The 
opposite side of the warped disk, located 
northwest of the stars, is facing toward Earth, 
so only the outer surface is visible; this is not 
illuminated by the stars, resulting in the fainter 
scattered light arc A4. Absorption due to dust 
in the warped disk reduces the illumination on 
the northwestern side and causes the broad 
shadows S3 and S4 at PA ~240° and ~20° re- 
spectively, corresponding to the directions with 
the highest radial column density in the warped 
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part of the disk. The surface of the warped disk 
also acts as a screen for shadows cast by the 
geometrically thin misaligned ring R3, result- 
ing in the sharply defined shadow SI. The 
curvature in S1 can then be understood as a 
projection effect, in which Slinner is the shadow 
cast on the warped surface inside of R2 and 
Slouter is the shadow on the nonwarped outer 
disk (Fig. 2A). To reproduce the on-sky pro- 
jected shape of R3, its off-center position with 
respect to the stars and the shape of shadows 
S1 and S2, we adopted a nonzero eccentricity 
(e = 0.3 + 0.1 for ring R3), with the stars lo- 
cated at one of the focal points of the ellipse. 
The eastern side of ring R3 is tilted away from 
us, which is consistent with emission from 
warm (~70 K) molecular gas that we de- 
tected at the inner surface of the ring (fig. S1) 
(16). The three-dimensional orientation of the 
orbits and dust rings in our model is illus- 
trated in Figs. 2 and 3 and parameterized in 
tables S5 and S6. 

Observational signatures of broken proto- 
planetary disks have been predicted in both 
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submillimeter thermal emission and near- 
infrared scattered light (5). That work con- 
sidered a circumbinary disk misaligned by 
60° with the binary orbit, similar to the mis- 
alignment angles observed for GW Orionis 
(51.1 + 1.1° for the A-B orbit and 38.5 + 0.8° 
for the AB-C orbit). There are similarities 
between our observations and the predicted 
synthetic images (5), including a misaligned 
and eccentric ring in submillimeter emission 
and an azimuthal asymmetry in scattered light 
with sharply defined shadows. The model ec- 
centricity of ring R3 matches the prediction 
that the dynamical perturbation by the stars 
should induce oscillations in the orbital inclina- 
tion and eccentricity of broken rings (4, 22, 23). 
We compared the radius of R3 (43 au) with 
analytic estimates of the disk-tearing radius, 
defined as the point in a circumbinary disk 
where the external torque exerted by a mis- 
aligned binary exceeds the internal torque 
because of pressure forces (4). We found that 
the predicted tearing radius is consistent with 
the size of R3 for disk viscosity values a < 0.05, 
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Fig. 3. Interactive three- 
dimensional model of the 
GW Ori system. The model 
can be zoomed and rotated in 
Adobe Reader to display the 
disk geometry. The x and y 
axes correspond to the 
directions of Dec (north) and 
RA (east), respectively, and 
the z axis points toward Earth. 
Zooming in shows the triple 
star orbits (GW Ori A, blue; 
GW Ori B, orange; and GW Ori 
C, black). 


Fig. 2. Scattered light model used to determine 
the eccentricity and three-dimensional orienta- 
tion of ring R3 and the geometry of the disk 
warp. (A) Diagram of the three-dimensional orienta- 
tion of the disk components in the model (16). 

(B) Orientation of the orbits in the hierarchical triple- 
star system, with the stellar positions at the time of 
our imaging measurements indicated (same as in 
Fig. 1). The coordinate system is centered on the center 
of mass of the system. The white grid indicates the 
observed plane of the sky (right ascension RA and 
declination Dec), and the z axis points toward the 
observer, represented by the eye symbol in (A). 

(C) Synthetic image computed (16) from the scattered 
light model, convolved to match the resolution of the 
observation. The position of the scattered light 
features from Fig. 1, C and D, are indicated. 


suggesting that this disk region is susceptible 
to disk tearing (16). 

We used the observational constraints on 
the orbital parameters of the GW Orionis sys- 
tem as input for simulations using smoothed 
particle hydrodynamics (SPH) and radiative 
transfer. We parameterized the initial disk 
with the observed total dust mass (24) and 
adopted the measured stellar orbits and outer 
disk orientation (/6). After a few thousand 
years, the gravitational torque from the mis- 
aligned triple-star system breaks the disk apart 
into several distinct planes. An eccentric ring 
forms with a radius of ~40 au, which precesses 
around the inner multiple system with a pre- 
cession period of ~8000 years. A snapshot of 
this simulation is shown in Fig. 4. The size of 
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log (column density / g cm’) 


the simulated ring, its eccentricity, asym- 
metric azimuthal density profile (with high- 
est density near the farthest part in the ring), 
and misalignment with the outer disk match 
the characteristics of ring R3 observed at sub- 
millimeter wavelengths. This suggests that 
ring R3 in the GW Orionis system formed by 
disk tearing. The SPH simulation also forms 
a low-density warped disk (Fig. 4C), whose 
properties and spatial orientation broadly 
resemble the disk warp in our scattered light 
model (Fig. 2). 

The origin of the gap between the two outer- 
most dust rings seen in millimeter emission 
(between R1 and R2) remains unclear. The gap 
might be primarily due to depletion in large 
dust grains because millimeter-sized dust grains 
might accumulate at the strong density gradi- 
ent near the outer edge of the disk warp (25). 
Alternatively, the dust gap might coincide with 
a lower gas density, which could be due to un- 
detected planets within the gap, or disk-tearing 
effects occurring further out in the disk that 
are not reproduced by our SPH simulation 
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(Fig. 4). Some hydrodynamic simulations of 
misaligned multiple stars found that disk tear- 
ing can result in a set of multiple nested rings 
[for example, (4)] or dust pile-up in warped 
disk regions resulting from differences in pre- 
cession between the gas and dust components 
(26), although we estimate that the gas drag 
forces excerted on the dust particles traced by 
our millimeter observations are likely too low 
for the latter mechanism to operate (6). 

Our results show that disk tearing occurs in 
young multiple-star systems and that it is a 
viable mechanism to produce warped disks 
and misaligned disk rings that can precess 
around the inner binary. By transporting ma- 
terial out of the disk plane, the disk-tearing 
effect could provide a mechanism for forming 
planets on oblique or retrograde orbits (orbit- 
ing in the opposite direction to the orbital axis 
and/or rotation axes of the stars). About 40% 
of short-period exoplanets (<40 days orbital 
period) are on oblique or retrograde orbits (27). 
The most commonly invoked explanations are 
planet-planet scattering and tidal interactions 
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Fig. 4. SPH simulation. 
The computation is based 
on the measured GW 
Orionis orbits and system 
parameters, evolved for 
9500 years. (A) Gas 
density projected on the 
plane of the sky, with 
north up and east left. The 
third axis (positive z) is 
facing out of the page. 

(B and C) Integrated 

gas density projected 

in the z-Dec plane and 
RA-z plane. (D) Density 
cut along the RA-z plane. 


column density / g cm? 


log (density / g cm*) 


from wider-orbiting planets (28). Few obser- 
vations are available for long-period planets 
and circumbinary planets, with all cases indi- 
cating close alignment between the stellar spin 
and planet orbit plane [the most inclined cir- 
cumbinary planet known is Kepler-413b, with 
obliquity of 2.5° (29)] (30). We found that disk 
tearing can induce large misalignments in a 
disk, which emerge sufficiently quickly to in- 
fluence the planet-formation process. The broken 
ring R3 contains ~30 Earth masses in dust (table 
S6), which could suffice for planet formation 
to occur. Long-period planets on highly oblique 
orbits could form from rings around misaligned 
multiple systems. 
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Proton-electron mass ratio from laser spectroscopy 
of HD* at the part-per-trillion level 


Sayan Patra’, M. Germann’, J.-Ph. Karr”, M. Haidar’, L. Hilico”®, V. I. Korobov‘, F. M. J. Cozijn’, 


K. S. E. Eikema?®, W. Ubachs?*, J. C. J. Koelemeij'+ 


Recent mass measurements of light atomic nuclei in Penning traps have indicated possible inconsistencies 
in closely related physical constants such as the proton-electron and deuteron-proton mass ratios. These 
quantities also influence the predicted vibrational spectrum of the deuterated molecular hydrogen ion 
(HD*) in its electronic ground state. We used Doppler-free two-photon laser spectroscopy to measure the 
frequency of the v = 0-9 overtone transition (v, vibrational quantum number) of this spectrum with an 
uncertainty of 2.9 parts per trillion. By leveraging high-precision ab initio calculations, we converted 

our measurement to tight constraints on the proton-electron and deuteron-proton mass ratios, consistent 
with the most recent Penning trap determinations of these quantities. This results in a precision of 

21 parts per trillion for the value of the proton-electron mass ratio. 


recision measurements on simple atomic 

systems and their constituents play an 

essential role in the determination of 

physical constants. Examples range from 

the proton-electron mass ratio (7,/™Me), 
the value of which depends strongly on mea- 
surements performed on single protons and 
hydrogen-like ions stored in Penning traps, to 
the Rydberg constant (R..) and proton electric 
charge radius (7p), which are derived from 
spectroscopic measurements of energy inter- 
vals in atomic hydrogen-like systems (J, 2). It is 
desirable to perform such determinations of 
physical constants redundantly by using dif- 
ferent systems and methods, as this provides a 
crucial cross-check for possible experimental 
inconsistencies or physical effects beyond our 
current understanding of nature. This need 
is illustrated by the proton radius puzzle, a 
5.60 discrepancy between the value of 7, ob- 
tained from muonic hydrogen spectroscopy 
and the 2014 Committee on Data for Science 
and Technology (CODATA-2014) reference value 
(/, 3). Progress toward solution of the puzzle was 
made after most of the recent 7, determi- 
nations from electron-proton scattering and 
atomic hydrogen spectroscopy were found to 
be consistent with the muonic hydrogen value 
(4-7). A similar need for alternative measure- 
ments is indicated for m,/m,—an important 
dimensionless quantity that sets the scale of 
rotations and vibrations in molecules—because 
recent Penning trap measurements of the 
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relative atomic masses of light atomic nuclei 
[including those of the proton (m,), deuteron 
(mg), and helion (m,)] differed from earlier 
results by several standard deviations (8-15). 
For example, Heifse et al. (11) determined m, 
with a precision of 32 parts per trillion (ppt), 
three times as high as the then-accepted 
CODATA-2014 value, but also found it to be 
smaller by 30 (//, 12). The value from (11) has 
been incorporated in the 2017 and 2018 
CODATA adjustments, but uncertainty mar- 
gins were increased by a factor of 1.7 to accom- 
modate the difference (2). This uncertainty 
range currently limits the precision of m,/7¢. 
(obtained by dividing m, by the more precise 
CODATA-2018 value of 7.) to 60 ppt, which 
in turn diminishes the predictive power of 
ab initio calculations of rotational-vibrational 
(rovibrational) spectra of molecular hydro- 
gen ions (H,* and HD‘) and antiprotonic 
helium, which have achieved a precision of 
7 to 8 ppt (16). 

The high theoretical precision, in princi- 
ple, enables an improved determination of 
m,/m, from spectroscopy of molecular hy- 
drogen ions, which could shed light on this 
situation (17). However, such an improvement 


requires measurements with uncertainties on 
the parts-per-trillion level, which is two orders 
of magnitude beyond that of state-of-the-art 
laser (18, 19) and terahertz (20) spectroscopy 
of HD* and antiprotonic helium. Here, we 
present a frequency measurement of the (v,Z): 
(0,3)—(9,3) vibrational transition (v, vibra- 
tional quantum number; L, rotational angu- 
lar momentum quantum number) in the 
electronic ground state of HD* with 2.9-ppt 
uncertainty, which is notably more precise 
than the theoretical uncertainty. This find- 
ing allows us to extract a new value of m,/m. 
and provide a cross-link to other physical 
constants, which enables additional consist- 
ency checks of their values. 

We previously identified the (v,L): (0,3)— 
(4,2)(9,3) two-photon transition in HD* 
(Fig. 1A) as a promising candidate for high- 
resolution Doppler-free laser spectroscopy (27), 
owing to the near-degeneracy of the 1442- and 
1445-nm photons, as well as the possibility of 
storing HD* ions in a linear Paul trap while 
cooling them to 10 mK through Coulomb in- 
teraction with cotrapped beryllium ions, which 
are themselves cooled by 313-nm laser radia- 
tion. We showed that for counterpropagat- 
ing 1442- and 1445-nm laser beams directed 
along the trap’s symmetry axis, Doppler-free 
vibrational excitation of HD* deep in the op- 
tical Lamb-Dicke regime may be achieved. 
Thus, with a natural linewidth of 13 Hz, 
quality factors of >10” become within reach. 
We used phase-stabilized, continuous-wave ex- 
ternal cavity diode lasers at 1442 and 1445 nm 
with linewidths of 1 to 2 kHz to vibrationally 
excite cold, trapped HD* ions (22). Optical fre- 
quencies were measured with an uncertainty 
below 1 ppt using an optical frequency comb 
laser, whereas two-photon excitation was de- 
tected through enhanced loss of HD* from the 
trap, owing to state-selective dissociation of 
molecules in the v = 9 state by 532-nm laser 
radiation (22, 23). 

Rovibrational energy levels of HD* exhibit 
hyperfine structure caused by magnetic inter- 
actions between the spins of the proton (I,), 


Table 1. Leading systematic shifts and uncertainties. Shifts and their standard uncertainties 
(in parentheses) are given in kilohertz. Their justification can be found in (22), as well as the complete 


error budget (table S2). 


Description 


F = 0 transition F = 1 transition 


0.03(1) “O11(3) 


0.38(43) 


0.42(43) 


Total systematic shifts + fitted optical frequencies 
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0.38(59) 0.42(66) 
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Fig. 1. Partial level diagram and multiphoton 
transitions. (A) Two-photon transitions are 
driven between rovibrational states with (v,L) = 
(0,3) and (9,3) in the 1so electronic ground 
state of HD*. State-selective dissociation of the 
v = 9 population is induced through excitation 
to the antibonding 2po electronic state by a 
532-nm photon. (B) Spin-averaged transition 
frequency (vsq) and hyperfine structure (not to 
scale) of the levels involved in the two-photon 
transition, as well as graphical definitions 

of the frequencies and detunings of the electro- 
magnetic fields driving transitions between 
them. (C) Graphical definition of the hyperfine 
intervals in the two-photon transition. 


Fig. 2. Spectra of the two-photon tran- 
sition at 415 THz. (A) Spectra of the F = 0 
transition at various levels of the rf power 
(P,¢). Lorentzian line fits are shown along 
with 68% confidence level bands. Each data 
point represents the mean of a set of 
(typically) nine individual measurements, 
with error bars indicating SEM. (B) Spectral 
data and Lorentzian line fits for the 

F = 1 transitions at two different values of 
the 532-nm laser intensity (/). (C) Fitted 
line centers of the F = O transitions 
[corrected for systematic shifts (22)] shown 
in (A) are additionally used to check for a 
possible quasi-resonant ac Zeeman shift 

by fitting a linear model and extrapolating to 
Pi, = O mW. The fit (dashed blue line) implies 
no significant shift. The zero-field F = 0 
frequency and uncertainty are indicated by 
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the red horizontal line and pink bands, respectively. (D) F = 1 line-center frequencies from the fits shown in (B), after correction for systematic shifts (22). The purple 
line and bands indicate the weighted mean and uncertainty, respectively. 


deuteron (Ig), and electron (s,), as well as the 
molecule’s rotational angular momentum (L) 
(24). The spins are coupled to form resultant 
angular momenta F = s, + I, andS = F+Iy 
and are finally coupled with L to form the 
total angular momentum J = S + L. We ob- 
served transitions (v,L; F,S,J): (0,3; 1,2,5) (9,3; 
12,5) (here referred to as the “F = 1 transi- 
tion”) and (v,L; F,S,J): (0,3; 0,1,4)—(9,3; 0,1,4) 
(the “F = 0 transition”); see Fig. 1B. 

To record a spectrum, we kept the 1442-nm 
laser frequency (vy) (with F = 0,1; see Fig. 
1B) at a fixed detuning (6,) from resonance 
to avoid excessive population of the inter- 
mediate v = 4 state (21, 22). Meanwhile, we 
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stepped the 1445-nm laser frequency (v’,) in 
intervals of 2 kHz over the range of interest 
(Fig. 1B). At each step, we let all lasers inter- 
act with the HD* ions for 30 s, after which we 
determined the cumulative loss of HD* and 
added the resulting data point to the spec- 
trum (22). A typical spectrum covers a span 
of 40 to 60 kHz, with an average of nine 
points per frequency and with the 180 to 270 
data points acquired in random order over 
~10 measurement days. The signal-to-noise 
ratio of the F = 0 spectrum turned out to be 
lower than its F = 1 counterpart, which we 
attribute to a smaller available population 
in the initial state and slower repopulation 
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by blackbody radiation (27). To increase the 
F = 0 signal, we applied two radio frequency 
(rf) magnetic fields to drive the population 
from the (F;S,J) = (1,2,5) and (1,2,4) states of 
the v = 0, L = 3 hyperfine manifold to the 
(F,S,J) = (0,1,4) states (see Fig. 1B and fig. 
S1) (22). Recorded spectra of the F = 0 and 
F = 1 transitions are shown in Fig. 2. 

The interpretation of the recorded spectra 
requires analysis of several systematic effects 
that affect line shape and position (22). We 
exploit the good theoretical accessibility of 
the HD* molecule (25), which allows a priori 
estimation of these effects. Zeeman and Stark 
effects are calculated to shift the F = 0 and 
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Fig. 3. Implications for the values of A 


physical constants. (A) Comparison 

between vsa,exp and theoretical frequencies 

VsA,theo (kK) obtained for the indicated 

combinations of physical constants, k. 

Arrows represent the cumulative frequency 

shift introduced by consecutively replacing 

the CODATA-2014 values of mp/me (blue), 

rp (yellow), ra (purple), R.. (green), and 

ma/Me (gray) with their counterparts 

of the set k. Error bars indicate 

lo uncertainty. (B) Values and uncertainties 

of m,/mMe from this work (blue data 

points) compared with measured m, values 

from other sources, which were converted 

to values of m,/m, through division by m. 
(CODATA-2018). The lowermost blue data B 
point represents the value derived in (C). 

Dashed lines and shaded areas represent 

CODATA values and their +lo ranges, 

with brackets indicating which of the 
measurements shown were included in 

the respective CODATA adjustments. 

Error bars indicate lo uncertainty. 
(C) Simultaneous constraint on mp/Me 
and mg/m, from HD* and recent 
independent measurements of these 
quantities, leading to new values of m,/m, 
and mg/Mp (blue dotted lines) and the 
corresponding lo-constrained region 
(white ellipse). 


F = 1 lines by as much as 0.5 kHz through 
level shifting and line-shape deformation (22). 
The expected two-photon power broadening 
and interaction-time broadening from the 
9 x 10? s' rate of dissociation of molecules 
in the v = 9 state (27) satisfactorily explain 
the observed linewidths of 8(3) kHz (number 
in parentheses denotes uncertainty). In addi- 
tion, we experimentally investigated a num- 
ber of systematic effects, yielding results 
consistent with the theory-based estimates 
(22). The sizes and uncertainties of leading 
systematic effects are listed in Table 1. 

As shown in Fig. 2, Lorentzian line shapes 
are fitted to the spectra to determine their 
respective line centers with 0.6- to 0.7-kKHz 
uncertainty. These are subsequently corrected 
for systematic frequency shifts and combined 
to arrive at the F = 0 and F = 1 transition fre- 
quencies: VyFo,.xp ANd VyFiexp (22) (see Fig. 2, 
C and D, and Table 2). These frequencies are 
related to the spin-averaged (i.e., pure rovibra- 
tional) frequency (vsq) through the relations 
Vsa = Vitro ~ foc aNd vga = Vii — fic (f, hyper- 
fine shift) (Fig. 1C). Because only vs, depends 
directly on the values of the physical con- 
stants of interest, we need to determine and 
correct for the hyperfine shifts f,. = -63 MHz 
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He + m,/m, (Heifee (12)) 
+ mglm, (Fink (14)) 
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12 
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i 1 
67.33 67.34 
(m,/m, - 1,836.15200) x 10° 


Table 2. Experimental and theoretical transition frequencies and hyperfine intervals. 
Uncertainties are given in parentheses and justified in detail in (22). The uncertainties of hyperfine 
intervals include an expansion factor of ~2. During data acquisition and in Fig. 2, theoretical 
frequency values (VuFotheo ANd Vuritheo) based on CODATA-2014 constants were used as offset 
values. All other theoretical frequency values were obtained from CODATA-2018 physical constants. 


Value (kHz) 
415,265,040,466.8 


p) 
114,999.7(.9) 


415,264,925,500.5(1.2) 


*Offset values based on CODATA-2014 constants (included for completeness). 


and fo. = 115 MHz to derive vga. We take the 
hyperfine intervals foctheo and f(ctheo from 
theory (22, 24, 26) and compute Vs exp as 
the mean of VHFO,exp ~ foc,theo and VHFl,exp — 
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Sic,theo (22). In this process, we expand the 
uncertainties of the theoretical hyperfine 
intervals by about a factor of 2 (22) so that 
the theoretical hyperfine interval ({o,tneo) 
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becomes consistent with its measured counter- 
part (Aoexp = VuFoexp — VHFiexp) (Table 2). We 
thus find Vgaexp = 415,264,925,500.5(0.4) exp 
.Dineo(-2)totat kHz. 

Our experimental frequency vga exp exceeds 
the theoretical frequency Vsa theo (CODATA- 
2014) = 415,264,925,467.1(10.2) kHz by 33.4 kHz, 
or 3.36, when we use CODATA-2014 physical 
constants to compute vga theo (22, 27). The un- 
certainties of these constants dominate the 
10.2-kHz uncertainty rather than the 3.1-kHz 
precision of the theoretical model—e.g., m/e. 
contributes 9.0 kHz (fig. S3) (22). Using known 
sensitivity coefficients (17, 22), we can also 
compute other theoretical frequency values, 
Vsa.theo (K), for other combinations (labeled ) 
of values of physical constants. For example, 
a more precise value is obtained by use of 
CODATA-2018 constants: vga theo (CODATA- 
2018) = 415,264,925,496.2(7.4) kHz. This state- 
of-the-art value is shifted by 29.1 kHz with 
respect to the CODATA-2014 value (Fig. 3A) 
and essentially closes the 33.4-kKHz gap with 
our experimental value (Vs exp). Figure 3A 
furthermore shows that most of the 29.1-kHz 
shift stems from the smaller CODATA-2018 
value of 7,/mm,.. A smaller part, 5.1 kHz, is due 
to the CODATA-2018 updated values of 7, Ta, 
and R., which are essentially equal to the 
muonic hydrogen values (3, 28). The 5.1-kHz 
shift, which is four times as large as our ex- 
perimental uncertainty and comparable to the 
current theoretical precision, therefore reveals 
the impact of the proton radius puzzle on mo- 
lecular vibrations. We obtain even better pre- 
cision (5.5 kHz) and agreement after replacing 
the CODATA-2018 value of m,/m,. with that 
from (J/, 12), this time leading to a 31.2-kHz 
shift (Fig. 3A). 

We may also invert the procedure and de- 
rive a new value of m,/m, from the difference 
Vsaexp — Vsa,theo (K); see Fig. 3B. Using vsa theo 
(CODATA-2018), we obtain m,/m. (HD*) = 
1,836.152673349(71), which is slightly more 
precise than, and in excellent agreement with, 
the value of m,/m, from (12). Because vga theo 
is also sensitive to the deuteron-proton mass 
ratio (22), one may alternatively extract a two- 
dimensional constraint in the (™m,/7., Mq/Mp) 
plane (Fig. 3C). Our result is in good agree- 
ment with both m,/m, from (2) and the recent 
value of mg/m, 4), assuming CODATA-2018 
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values of 7, 7a, and R.. This justifies a deter- 
mination of m,/me. from the combination of 
all three results shown in Fig. 3C, leading to a 
value of 1,836.152673406(38) (lowermost point 
in Fig. 3B) which, at 21-ppt precision, repre- 
sents the most precise determination of this 
quantity to date. The data shown in Fig. 3C can 
furthermore be combined with the CODATA- 
2018 value of m, and the value of 7, from (5) 
to obtain the atomic mass difference 7, + Mma - 
My = 0.00589743254(12) u (where u is the uni- 
fied atomic mass unit). The same quantity has 
previously been determined from the measured 
mass ratio "He*/HD* (13), leading to mp + ma - 
My = 0.00589743219(7) u. The two results differ 
by 0.35(14) nu, or 2.50. We thereby confirm the 
“He puzzle,” a term used to describe similar 
deviations of 0.48(10) nu (4.80) and 0.33(13) nu 
(2.40) reported earlier (13, 14). 

Our work establishes precision spectros- 
copy of HD*, combined with ab initio quantum 
molecular calculations, as a state-of-the-art 
method for determining fundamental mass 
ratios. It furthermore provides a link between 
mass ratios and other physical constants, such as 
R., and sheds light on the large deviations seen 
between recent determinations of their values. 
We anticipate that our results will have a notable 
impact on the consistency and precision of fu- 
ture reference values of physical constants and 
will enhance the predictive power of ab initio 
calculations of physical quantities. 

Note added in proof: In a recent and inde- 
pendent study by Alighanbari et al. (29), a value 
for the proton-electron mass ratio comparable 
to ours was obtained from rotational spectros- 
copy of HD*. 
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CLIMATE RESPONSES 


Predicting temperature mortality and selection in 
natural Drosophila populations 


Enrico L. Rezende’*, Francisco Bozinovic?, Andras Szilagyi**, Mauro Santos*+ 


Average and extreme temperatures will increase in the near future, but how such shifts will affect mortality in 
natural populations is still unclear. We used a dynamic model to predict mortality under variable temperatures 
on the basis of heat tolerance laboratory measurements. Theoretical lethal temperatures for 11 Drosophila 
species under different warming conditions were virtually indistinguishable from empirical results. For 
Drosophila in the field, daily mortality predicted from ambient temperature records accumulate over weeks or 
months, consistent with observed seasonal fluctuations and population collapse in nature. Our model quantifies 
temperature-induced mortality in nature, which is crucial to study the effects of global warming on natural 
populations, and analyses highlight that critical temperatures are unreliable predictors of mortality. 


lobal warming is a major threat to bio- 
diversity, with temperature averages 
and extremes forecasted to change sub- 
stantially in the next 50 years (7), and 
predicting which lineages, commu- 
nities, and geographical regions are more 
vulnerable constitutes a major challenge (2, 3). 
Numerous research groups have used critical 
limits, namely the lower or upper temper- 
atures at which performance drops to zero 
(4, 5), to unravel broad-scale macroecolog- 
ical patterns such as a higher vulnerability to 
rising temperatures in terrestrial organisms 
from the tropics (6, 7). However, the reliability 
of these proxies is uncertain because different 
experimental protocols elicit different estimates 
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Fig. 1. Predicting mortality in 
thermally variable environ- 
ments. (A) Time to death 
under a constant thermal 
regime varies predictably in a 
log-linear fashion with body 
temperature, giving rise to 
typical thermal death time 
curves whose slope quantifies 
the thermal sensitivity z. Data 
points represent simulated 
individuals that collapsed dur- 
ing a static assay. (B) In light 
of this relationship, results 
from different temperature 
assays can be expressed using 


logio Time 


Temperature 


(8, 9) and values are often unusually high when 
compared with temperatures encountered in 
nature [(J0), but see (17)]. 

Here, we show that these inconsistencies 
can be explained with a common theoretical 
framework, whose main premise is that the 
cumulative impact of any thermal stress varies 
with temperature and time. Similar methods 
are commonly used in the food processing and 
pest control literature (12, 13), and yet their 
application in thermal ecology remains conten- 
tious [but see (4, 15)]. We previously combined 
survival probability functions with measure- 
ments of elapsed time for thermal death to 
obtain a continuous “tolerance landscape” at 
different constant temperatures (76) and now 
expand this framework to predict survival in a 
variable environment. The logarithm of sur- 
vival times varies linearly with temperature 
(Fig. 1A), and results in parallel thermal death 
time curves for different relative survivals (S) that 
can be described with a simple relation between 
exposure times (t) and temperature (7): 


TT? (T9-T)) 


=10~ 


(for any given S) (1) 
q 


Constant temperatures 


Survival (S) 


Time (t) 


where z > 0 corresponds to the thermal sen- 
sitivity describing the AT required to change 
t one order of magnitude (Fig. 1A and sup- 
plementary materials). Thus, if g = 2°C, an 
organism that tolerates 40°C for 1 min could 
withstand 38°C for 10 min and 36°C for 
100 min. For standardization purposes, hereafter 
we refer to the temperature at which mean t = 
1 min as T,y2x, Which corresponds to the tem- 
perature at which the linear regression touches 
the abscissa with log,9-transformed t (Fig. 1A). 
Equation 1 implies that any survival proba- 
bility function S7(t) shifts horizontally by 
107 asa consequence of 1°C increase in tem- 
perature, and therefore, we can transform 
any survival probability function from T; to 
T, with the relationship (for notational sim- 
plicity, we denote the temperature dependence 
by lower index): 


(T9-T)) 


Sr, (t) => S7,10- TY (2) 
(Fig. 1B). In a variable thermal environment, 
changes in the survival probability function 
are coupled to changes in temperature and 
survival times as 


At (3) 


(Fig. 1C). The survival rate at any given time 
v can be calculated by summing up the in- 
finitesimal small changes in the time interval 
[0, t’]. This can be accomplished analytically 
or numerically (supplementary materials). 
We validated the numerical method, which 
has the advantage of not requiring the analyt- 
ical form of S(t) to be defined, by success- 
fully predicting survival responses of several 
Drosophila species subjected to highly contrast- 
ing warming regimes. The dataset (77) comprises 
11 species whose survival was measured at 
constant temperatures between 32° and 43.5°C 
(n = 1289 individuals) and that were also 


Rising temperatures 


Survival (S) 


Time (t) 


a single survival probability curve that shifts in time by a factor determined by z, increasing or decreasing mortality rates. (©) Using this framework predicted how 
temporal variation in temperature affects the survival probability curve and, therefore, mortality rates under variable temperature conditions. 
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Fig. 2. Predicted versus 
observed heat tolerance. 
Predictions were validated 
against heat tolerance 
estimates obtained empir- 
ically in 11 Drosophila spe- 
cies. (A) Fitted versus 
reported death times 
under constant thermal 
regimes, which supports 
our contention that 
empirical survival curves 
obtained at different 
temperatures can be 
described by a single sur- 
vival probability function 


12h 


3h 


Time measured 
10 min 


1min 


10 min 


1 min 


Time fitted 


Constant temperatures 


0.10 °C min“ 
4 


Time measured 


0.25 °C min 


th 3h 12h 1h 3h 


Time predicted 


0.05 °C min“ 


Rising temperatures 


Temperature measured (°C) 


12h 36 38 40 42 44 
Temperature predicted (°C) 


that shifts in time (Fig. 1). (B) Predicted versus reported death times and (C) lethal temperatures at different warming rates. The dotted line represents the equality 
xX = y and the continuous line an ordinary least-squares regression; points are shown in a color gradient from low (blue) to high temperatures (red). 


subjected to warming temperatures starting at 
19°C and increasing 0.05°, 0.10°, and 0.25°C/ 
min (7 = 504 individuals) until individuals died. 
Assays lasted between ~1 min and 17.5 hours 
at constant temperatures and between 1 and 
8 hours in ramping experiments, and dehy- 
dration or starvation effects (9) were pre- 
vented by giving flies access to food and 
water throughout the trials (17). We initially 
determined how well a single survival proba- 
bility curve fitted the data obtained at differ- 
ent temperatures (figs. S1 to S3): We pooled 
all values into a single S(t) at mean T and 
remapped this curve back to each tempera- 
ture to obtain theoretical estimates of survival 
(Eq. 2). Across species, Tinax ranged between 
38.9° and 44.8°C, and z between 1.81° and 3.53°C. 
Fitted estimates closely resembled empirical 
values (Fig. 2A), with R? between fitted and 
empirical data ranging between 0.946 and 
0.984 for each species (R” = 0.965 + 0.012, 
mean + SD). Therefore, a single S(t) for each 
species can successfully describe survival rates 
across thermal regimes. 

We then used each species’ survival function 
to predict how it would respond to rising tem- 
peratures (fig. S4). Predicted times and tem- 
peratures for collapse were fairly accurate when 
contrasted against empirical measurements 
(Fig. 2, B and C). These results highlight that 
different tolerance estimates correspond toa 
single trait expressed in different environments, 
instead of multiple traits presumably under in- 
dependent genetic control (78, 79), and bridge 
the gap between measurement reliability and 
ecological realism (19, 20). With this frame- 
work, it should be possible not only to compare 
species’ thermal tolerances under standardized 
conditions but also to predict the intensity of 
selection under natural temperature regimes. 
Because this model translates probability dis- 
tributions from static to dynamic conditions 
using basic calculus, and thermal death time 
curves are pervasive in terrestrial and aquatic 
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ectotherms (16), the adequacy of this method 
lies primarily on the appropriate estimation of 
body temperatures in the field. Although here 
we assume that body temperature equals 
ambient temperature, which is reasonable for 
Drosophila (21), for other ectotherms whose 
thermal inertia is not negligible body tem- 
peratures can be estimated from biophysical 
modeling and ambient temperature records. 
However, empirical validation in other taxa 
remains necessary. 

We next explored how estimates of heat 
tolerance in Drosophila subobscura translate 
to conditions in the field. This species has 
been extensively studied and provides one of 
the most compelling cases for adaptive genetic 
shifts in response to climate change (22). To 
predict mortality, we combined the survival 
probability curves S(t) of a mid-latitude (33° 
27'S) population of D. subobscura from Santi- 
ago (23) acclimated to 13° and 18°C (n = 237 
and 227 individuals, respectively) with the 
daily thermal profile of this location. Temper- 
atures were obtained with an hourly resolu- 
tion for 1984 to 1991 and 2014 to 2018 (table 
S1 and fig. S5). According to the model, the 
seemingly similar tolerance estimates for cold- 
acclimated (Tinax = 41.1°C and g = 4.9°C) and 
warm-acclimated flies (Tmax = 40.3°C and z = 
3.7°C) should result in contrasting mortality 
rates in the field (Fig. 3), primarily because 
of z, given that mortality is expected at low 
temperatures compared with T\yax. For cold- 
acclimated flies, daily mortality >10% is pre- 
dicted during mid-austral spring, with current 
thermal maxima in November in the order of 
26.8° + 13°C, whereas for warm-acclimated 
flies, elevated mortality is only expected 
during summer, with maxima in January of 
30.2° + 0.9°C. Thus, acclimation has a marked 
impact on heat tolerance, increasing the win- 
dow for reproduction by nearly a month from 
mid-spring to early summer. However, compar- 
isons between 1984 to 1991 and 2014: to 2018 


4 September 2020 


suggest that this window is jeopardized by 
global warming (Fig. 3). 

To validate predicted responses in the field, 
we compared predicted cumulative mortality 
curves (calculated as the product of the daily 
survival, which assumes that individuals that 
survived the thermal stress can recover during 
the night) against reported fluctuations in 
the population size of D. subobscura under 
natural conditions from a long-term longitu- 
dinal study in a field population from Santiago 
(24). The daily abundance of D. subobscura 
was monitored monthly between 1984 and 
1991 and exhibited a marked seasonal perio- 
dicity, with the number of collected individu- 
als ranging from 1.0 + 1.1 by the end of the 
austral summer (March) to 439.9 + 218.6 flies 
in the abundance peak in mid-spring (October 
and/or November). Every year, the number 
of caught flies increased roughly exponen- 
tially as temperatures rose from winter to 
spring and collapsed by mid-spring and early 
summer as predicted by the cumulative mor- 
tality curves (Fig. 3). The temporal window 
in which population collapse occurs involves 
maximum temperatures that are up to 10°C 
lower than the published heat tolerance es- 
timates for D. subobscura, which range be- 
tween 35.1° and 38.6°C under gradual heating 
(Fig. 3). 

Our analysis suggests that strong thermal 
selection occurs over time at temperatures 
that are low in comparison with estimated 
upper critical thermal limits. We cannot dis- 
card competitive interactions with other 
drosophilids coexisting with D. subobscura (24) 
that might affect its distribution and abun- 
dance in summer (25) or other factors (such 
as ontogenetic variation in thermal sensitivity) 
affecting the physiological status of the flies. 
However, our results match patterns observed 
for chromosomal inversion polymorphisms, in 
which strong selective shifts were detected at 
temperatures that seldom surpass 30°C (26). 
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Winter 


Temperature (°C) 


® Cold-acclimated 13 °C 
® Warm-acclimated 18 °C 


Daily mortality (%) 
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abundance 
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Nov 
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--+- 1984-1991 ar 
2014-2018 , 


Dec Jan Feb 


Fig. 3. Mortality rates and selection under natural conditions. (A) Averaged hourly temperatures for 
2014 to 2018 in Santiago, Chile. (B) Predicted mortality on a daily basis based on these temperatures for 
acclimated D. subobscura. (C) Cumulative mortality for both 1984 to 1991 and 2014 to 2018. Arrows show the 
predicted impact of global warming during this 30-year period. The gray lines show the daily abundance 
of D. subobscura, monitored monthly in the field between 1984 and 1991, expressed relative to each year's 
maximum (the shaded area is a convex hull bounding the observed abundances). Mortality cannot be 


predicted from the critical maximum for this species. 


Analyses also highlight that measurements 
of critical temperatures may seriously over- 
estimate warming tolerance (JO, 27) because 
a low thermal mortality on a daily basis will 
accumulate over time (Fig. 3). For instance, a 
nearly imperceptible daily mortality of 1% 
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results in a cumulative mortality of ~20% after 
3 weeks (1 - 0.99”). And these predictions 
might be conservative, as they ignore the ther- 
mal impact on fecundity (28). 

Temperature tolerance has been studied in 
comparative physiology for many decades (29), 
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and here we propose a paradigm shift from a 
static critical limit to a dynamic, more realistic 
and theoretically sound framework (72, 13). Our 
model provides an intuitive tool to assess how 
laboratory measurements translate into differ- 
ences in survival in the field, which may be 
expanded in future analyses to include other 
factors, such as thermal heterogeneity in the 
environment, behavioral thermoregulation, 
ontogenetic variation in heat tolerance, or 
thermal inertia. This framework is not restricted 
to Drosophila. In principle, it is readily appli- 
cable to other small ectotherms whose survival 
can be adequately measured in the laboratory 
and thermal microhabitats estimated accu- 
rately in the field (30). 
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BIOGEOCHEMISTRY 


Plants sustain the terrestrial silicon cycle during 


ecosystem retrogression 


F. de Tombeur"*, B. L. Turner’, E. Laliberté*“, H. Lambers*, G. Mahy’, M.-P. Faucon®, 


G. Zemunik’, J.-T. Cornelis? 


The biogeochemical silicon cycle influences global primary productivity and carbon cycling, yet changes 
in silicon sources and cycling during long-term development of terrestrial ecosystems remain poorly 
understood. Here, we show that terrestrial silicon cycling shifts from pedological to biological control 
during long-term ecosystem development along 2-million-year soil chronosequences in Western 
Australia. Silicon availability is determined by pedogenic silicon in young soils and recycling of plant- 
derived silicon in old soils as pedogenic pools become depleted. Unlike concentrations of major 
nutrients, which decline markedly in strongly weathered soils, foliar silicon concentrations increase 
continuously as soils age. Our findings show that the retention of silicon by plants during ecosystem 
retrogression sustains its terrestrial cycling, suggesting important plant benefits associated with this 


element in nutrient-poor environments. 


ilicon (Si) is widely recognized as an im- 
portant regulator of the global carbon 

cycle via its effect on diatom productivity 

in oceans (2) and the weathering of sili- 

cate minerals on continents (2). Si is also 

a beneficial plant nutrient (3, 4), improving 
resistance to herbivory and pathogens (5) and 
mitigating the negative effects of several abiotic 
stresses (6), including nutrient limitation (7, 8). 
As a result, Si improves plant performance and 
can contribute to the functioning of terrestrial 
ecosystems (5, 9, 10). Detailed information on 
long-term controls on Si cycling therefore un- 
derpins our understanding of Si-related func- 
tions in plants, fluxes to aquatic ecosystems, 
and ultimately the fixation of atmospheric 
carbon in terrestrial and oceanic ecosystems. 
The release of Si into the soil solution reg- 
ulates its availability to plants and its transfer 
from land to oceans. While the concentration 
of dissolved Si in the soil solution has long 
been understood to be driven primarily by 
geochemical processes (i.e., mineral dissolu- 
tion), it is now recognized that Si mobility is 
influenced strongly by plant biocycling (77-73). 
The polymerization of amorphous silica in leaf 
tissues (i.e., the formation of phytoliths) and 
its return to topsoil after leaf shedding builds 
a pool of reactive silicate in soil (14). However, 
the magnitude of geochemical versus bio- 
logical processes in controlling the release 
of Si to the soil solution remains debated. 
Although soil scientists often assume that 
geochemical processes control dissolved Si 
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concentrations (75), mass-balance calculations 
suggest a strong imprint of biological pro- 
cesses (i.e., phytolith formation in plants and 
dissolution in soils) on the Si cycle (JJ, 16, 17), 
driven by the order-of-magnitude greater disso- 
lution rate of phytoliths compared with clay 
minerals (14). As soil Si is derived ultimately 
from the parent rock, plant-available Si con- 
centrations are expected to decrease with soil 
age through desilication (i.e., Si leaching during 
pedogenesis) (78), thus increasing the impor- 
tance of biological processes as soils and 
ecosystems develop (12, 13). However, the 
emergence of biological control of terrestrial 
Si cycling as soils age is still poorly under- 
stood, in part because of the limited number 
of study systems spanning sufficiently long 
time scales. 

To quantify changes in pedological and 
biological controls of Si cycling during long- 
term ecosystem development, we studied Si in 
soils and plants along a pair of 2-million-year 
coastal dune chronosequences in southwestern 
Australia (79). Such long-term chronosequences 
that have not been directly affected by Pleisto- 
cene glaciations are rare worldwide (20). The 
Jurien Bay and Guilderton chronosequences 
include the end-members of soil formation 
(8, 21), providing a rare opportunity to study 
long-term shifts in biogeochemical cycles. Soil 
development along these chronosequences 
includes carbonate leaching from Holocene 
soils [<6.5 thousand years (ka); stages 1 to 3], 
formation of secondary Si-bearing minerals 
in young Middle Pleistocene soils (~120 ka; 
stage 4) followed by their loss via dissolution 
in medium-aged and old Middle Pleistocene 
soils (~250 to 500 ka; stage 5), to yield quartz- 
rich soils of Early Pleistocene age (~2000 ka; 
stage 6) (18, 21). 

Along each chronosequence, we quantified 
the pools of reactive Si-bearing phases and 
plant-available Si in the soils and physically 
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extracted phytoliths (22). In addition, we quan- 
tified Si and major nutrients in mature leaves 
of the most abundant plants growing along 
the best-studied of the two chronosequences 
(Jurien Bay) (21, 23, 24). We used the con- 
centrations of Si and nutrients in leaves to 
indicate the degree of elemental biocycling. 
We hypothesized that, as soils aged, the pools 
of reactive and plant-available Si would be in- 
creasingly determined by recycling from phy- 
toliths. We also hypothesized that plant foliar 
Si concentrations would decrease with soil age 
owing to the loss of Si-bearing minerals and 
quartz enrichment, as do the concentrations of 
major rock-derived nutrients, such as phos- 
phorus, during long-term pedogenesis (24). 

The reactive pedogenic Si pool (poorly crys- 
talline aluminosilicates of nonbiogenic origin, 
estimated by oxalate extraction) increased 
markedly from Holocene soils (stages 1 to 3) 
to young Middle Pleistocene soils (stage 4), 
associated with the formation of clay minerals 
(18) (Fig. 1A; from <250 kg ha? to =2000 ke 
ha *). However, desilication during prolonged 
soil weathering resulted in the complete loss 
of the reactive pedogenic Si pool in the oldest 
stage of the chronosequences. 

The Si:Al ratio of alkali-reactive Si (mea- 
sured in hot 1% NagCO3) indicates the origin 
of this pool: values >5 suggest a biogenic 
origin (22). Alkali-reactive Si stocks were 
lowest in the three first stages (<1200 kg ha’) 
and had a mostly nonbiogenic origin (Si:Al 
0.5 to 1.8) (Fig. 1B), indicating a contribution of 
lithogenic and/or pedogenic minerals. Alkali- 
reactive Si increased strongly in stage 4 (3800 
to 6100 kg ha’), but the Si:Al ratio remained 
typical of lithogenic and pedogenic minerals 
(14 to 2.1). In contrast to the reactive pedo- 
genic Si pool, however, alkali-reactive Si did 
not disappear during long-term pedogenesis, 
varying between 2500 and 6300 kg ha’ in the 
most advanced stage of soil weathering and 
having Si:Al ratios >5, which indicates a bio- 
genic origin (25). 

Plant-available Si quantified by extraction in 
0.01 M CaCl, followed a pattern similar to the 
reactive pedogenic Si pool, increasing to a maxi- 
mum by stage 4 and then decreasing toward 
the oldest stage (Fig. 1C). The stocks of plant- 
available and reactive pedogenic Si were 
significantly correlated [coefficient of deter- 
mination (R) = 0.68; P < 0.01; n = 9 soil 
profiles] along both chronosequences. 

Soil phytoliths extracted physically by gravi- 
metric separation were concentrated in the 
surface soil horizon, where plant-available Si 
concentrations were also highest (22). The 
concentration of soil phytoliths was positively 
correlated with that of plant-available Si in soil 
horizons dominated by quartz minerals (Fig. 2). 
The contribution of phytoliths to plant-available 
Si was supported by dissolution features on 
phytoliths, which increased with depth in the 
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two soil profiles selected for microscopic ob- 
servations (22) (Fig. 3; from 6 to 8% in topsoil 
to 40 to 42% in deeper horizons). 

The mean foliar Si concentration of the 
10 most abundant species per plot at Jurien 
Bay increased continuously with soil age, from 
0.5 + 0.2 g Sikg ‘in stage 1 to 4.2 + 142 Sikg 
in stage 6 (Fig. 4), where Si availability was 
controlled by phytolith dissolution. By contrast, 
foliar concentrations of the major rock-derived 
plant nutrients (P, Ca, K, and Mg) followed 
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the opposite pattern, decreasing as soils aged 
(Fig. 4). Results were qualitatively similar when 
element concentrations were weighted by the 
relative canopy cover of each species in the plot, 
showing that these shifts in foliar chemistry 
represented community-level responses (22). 
The patterns of increasing foliar Si and de- 
creasing major nutrient concentrations were 
associated with changes in plant species com- 
position across the sequence, with dicot woody 
species in the Proteaceae and Dilleniaceae 
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contributing most strongly to the increase in 
foliar Si concentrations on the oldest soils (22). 
However, this pattern also occurred (except for 
K) within the few individual species that were 
sampled across multiple stages of the chrono- 
sequence (22). Together, these results suggest a 
selective advantage for plants that accumulate 
more Si on nutrient-depleted soils. 

Our results provide clear evidence for a 
shift from pedological to biological control in 
the terrestrial Si cycle during long-term soil 
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Fig. 1. Stocks of Si pools across the Jurien Bay and Guilderton chronosequences. Stocks of reactive pedogenic Si extracted with oxalate (A), alkali-reactive Si 
extracted with NasCO3 (B), and plant-available Si extracted with CaCl. (€) for the upper 50 cm of soil. In (B), the bar colors and labels indicate the depth-weighted 


mean ratio of alkali-reactive Si to alkali-reactive Al. Soil age increases with increasing chronosequence stage. 
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development. In the early and intermediate 
stages of soil development, the positive re- 
lationship between pedogenic-reactive and 
plant-available Si supports the hypothesis that 


geochemical processes drive Si availability 
(15, 18). This is consistent with the global- 
scale relationship between soil pH and plant- 
available Si (18), because soil pH is related to 
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Fig. 3. Estimation of phytolith dissolution features with depth in two soil profiles. The y axis indicates 
the pedogenic horizons. Soil depth increases from the horizon O (litter) to E2 (70 to 140 cm) for Jurien 
Bay stage 6 and from the horizon A (0 to 10 cm) to B2 (89 to 140 cm) for Guilderton stage 5A. 
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soil buffering capacity that is driven by 
weathering processes (26). However, with in- 
creasing soil age from ~120 ka to ~2000 ka 
(stages 4 to 6), the pool of pedogenic reactive 
Si disappeared entirely, while that of alkali- 
reactive Si remained large and was dominated 
by soil phytoliths returned to the soil via litter. 
Along with this shift, and despite the decrease 
of plant-available Si from stages 4 to 6 reach- 
ing among the lowest values worldwide (/8), 
the mean foliar Si concentrations of the most 
abundant plant species were highest in the 
last stage along the Jurien Bay chronosequence. 
This shows that the terrestrial Si cycling is 
sustained by strong plant retention of Si in 
highly desilicated soils. Given the abundance 
of quartz in these soils, its dissolution must 
contribute to Si availability (77). However, the 
correlation between phytoliths and plant- 
available Si for the quartz-rich horizons dem- 
onstrates that the order-of-magnitude greater 
solubility of biogenic amorphous silica com- 
pared with quartz (14) compensates for the 
lower concentration of phytoliths in driving 
plant-available Si. 

We assume a negligible dust imprint on Si 
dynamics in topsoils along the two chronose- 
quences (27), such that the increase in plant- 
available Si in the topsoil horizons of the 
intermediate and old soils supports the strong 
impact of phytolith dissolution in desilicated 
environments. Whereas P, Ca, K, and Mg are 
essential plant nutrients and associated with 
organic matter inside cells, Si precipitates to 
form prominent silica structures between cell 
walls and the lumen and in extracellular and 
intercellular spaces of leaf epidermis (28, 29). 
This implies that P, Ca, K, and Mg are released 
more readily during litter degradation (30, 37). 
Conversely, phytoliths can be preserved in the 
soil environment for months to millennia 
(17, 32) and therefore provide a long-term 
source of Si to plants (31, 33). In addition, 
unlike most nutrients, Si is not remobilized 
during leaf senescence, implying that all Si 
is returned to soil via litterfall. Our results 
thus show that the return of phytoliths to 
topsoil is a key process contributing to a 
slow-release source of Si that sustains the 
terrestrial cycle over geological time scales. 
We expect these results to be relevant broadly 
for other systems, given that the chronose- 
quences span the approximate range of soil 
ages worldwide and represent three globally 
relevant soil domains known to occur during 
long-term pedogenesis: carbonate leaching, 
formation of secondary minerals, and the 
subsequent loss of said minerals through 
dissolution and eluviation (26). 

The oldest soils of the Jurien Bay chrono- 
sequence are among the most strongly weathered 
and nutrient-depleted worldwide (21). How- 
ever, unlike the major plant nutrients for which 
foliar concentrations decreased markedly with 


3 of 4 


RESEARCH | REPORT 


Fig. 4. Mean foliar concentra- 
tions of silicon, calcium, 
magnesium, potassium, and 20% 
phosphorus of mature indivi- 
duals of the 10 most abundant 
plant species per plot along 
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increasing soil age, foliar Si concentrations 
showed the opposite pattern. The Jurien Bay 
chronosequence is characterized by strong 
turnover of plant species (34), reflecting the 
expression of selective edaphic forces acting 
on a species-rich regional flora over an eco- 
logical time scale (23). As a result, species 
adapted to older, nutrient-impoverished soils 
have low foliar concentrations of rock-derived 
nutrients (24) but accumulate Si in their 
leaves. Increases in foliar Si concentrations 
could be partly attributable to longer leaf life 
spans, because Si tends to accumulate as 
leaves age (35), and plants growing on nutrient- 
poor soils often increase nutrient-use effi- 
ciency by producing longer-lived leaves (36). 
The biological control of the Si cycle during 
ecosystem retrogression may also reflect im- 
portant Si-based plant functions (4, 5, 10). Re- 
duced herbivory through silica deposition 
(5, 9) in plants growing on the older soils 
could have adaptive value in these nutrient- 
impoverished habitats by minimizing tissue 
loss and therefore increasing the mean resi- 
dence time of nutrients and nutrient-use ef- 
ficiency (37, 38). In addition, there is mounting 
evidence that Si allows plants to withstand 
phosphorus stress (7), reflected in a high foliar 
Si concentration (8). Therefore, the mainte- 
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nance of the terrestrial Si cycle by plants in 
ecosystems undergoing retrogression suggests 
important, but overlooked, beneficial effects in 
nutrient-poor environments. 
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Structural basis for translational shutdown and 
immune evasion by the Nsp1 protein of SARS-CoV-2 


Matthias Thoms?*, Robert Buschauer™, Michael Ameismeier**, Lennart Koepke, Timo Denk’, 
Maximilian Hirschenberger’, Hanna Kratzat', Manuel Hayn?, Timur Mackens-Kiani?, Jingdong Cheng?, 
Jan H. Straub?, Christina M. Stiirzel”, Thomas Fréhlich®, Otto Berninghausen’, Thomas Becker’, 
Frank Kirchhoff2, Konstantin M. J. Sparrer?+, Roland Beckmann*+ 


Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of the current 
coronavirus disease 2019 (COVID-19) pandemic. A major virulence factor of SARS-CoVs is the 
nonstructural protein 1 (Nsp1), which suppresses host gene expression by ribosome association. Here, 
we show that Nsp1 from SARS-CoV-2 binds to the 40S ribosomal subunit, resulting in shutdown of 
messenger RNA (mRNA) translation both in vitro and in cells. Structural analysis by cryo-electron 
microscopy of in vitro-reconstituted Nsp1-40S and various native Nsp1-40S and -80S complexes 
revealed that the Nsp1 C terminus binds to and obstructs the mRNA entry tunnel. Thereby, Nsp1 
effectively blocks retinoic acid—inducible gene |-dependent innate immune responses that would 
otherwise facilitate clearance of the infection. Thus, the structural characterization of the inhibitory 
mechanism of Nsp1 may aid structure-based drug design against SARS-CoV-2. 


oronaviruses (CoVs) are enveloped, single- 
stranded viruses with a positive-sense 
RNA genome which infect a large va- 
riety of vertebrate animal species. Cur- 
rently, seven CoV species from two genera 
(Alphacoronavirus and Betacoronavirus) are 
known human pathogens, four of which usu- 
ally cause only mild respiratory diseases like 
common colds (J-5). Over the last two dec- 
ades, however, three betacoronaviruses (beta- 
CoVs)—the severe acute respiratory syndrome 
coronavirus (SARS-CoV), the Middle East res- 
piratory syndrome coronavirus (MERS-CoV), 
and the severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2)—have emerged 
as the causative agents of epidemic and, in 
the case of SARS-CoV-2, pandemic outbreaks 
of highly pathogenic respiratory diseases. The 
disease caused by SARS-CoV-2, coronavirus 
disease 2019 (COVID-19), has affected millions 
of people, with a death toll amounting to 
hundreds of thousands worldwide (6, 7). 
Coronavirus particles contain a single, 5’- 
capped and 3’-polyadenylated RNA genome 
which codes for two large overlapping open 
reading frames in gene 1 (ORFla and ORF'Ib), 
as well as a variety of structural and non- 
structural proteins at the 3’ end (8, 9). After 
host infection, precursor proteins ORFla and 
ORF iab are translated and subsequently pro- 
teolytically cleaved into functional proteins, 
most of which play roles during viral replication 
(0). Among these proteins is the N-terminal 
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nonstructural protein 1 (Nsp1). Despite differ- 
ences in protein size and mode of action, Nsp1 
proteins from alpha- and beta-CoVs display a 
similar biological function in suppressing host 
gene expression (1/-14). SARS-CoV Nsp1 in- 
duces a near-complete shutdown of host pro- 
tein translation by a two-pronged strategy: 
first, it binds the small ribosomal subunit and 
stalls canonical mRNA translation at various 
stages during initiation (15, 16). Second, Nsp1 
binding to the ribosome leads to endonucleo- 
lytic cleavage and subsequent degradation of 
host mRNAs. Notably, interactions between 
Nsp1 and a conserved region in the 5’ un- 
translated region (UTR) of viral mRNA prevent 
shutdown of viral protein expression through 
an unknown mechanism (/7). Taken together, 
Nsp1 inhibits all cellular antiviral defense 
mechanisms that depend on the expression 
of host factors, including the interferon re- 
sponse. This shutdown of the key parts of the 
innate immune system may facilitate efficient 
viral replication (73, 18) and immune evasion. 
Its central role in weakening the antiviral im- 
mune response makes SARS-CoV Nsp1 a poten- 
tial therapeutic target (19, 20). Here, we set out 
to characterize the interaction of Nsp1 of SARS- 
CoV-2 with the human translation machinery. 
Nsp1 of SARS-CoV-2 shows 84% amino acid 
sequence identity with SARS-CoV, suggesting 
similar properties and biological functions 
(Fig. 1A). The C-terminal residues Lys!“ (K164) 
and His'® (H165) in SARS-CoV are conserved 
in beta-CoVs and essential for 40S interaction, 
as mutations to alanine abolish 40S binding 
and relieve translational inhibition (16). To 
confirm an analogous function of Nsp1 from 
SARS-CoV-2, we expressed and purified re- 
combinant Nsp1 and the K164—Ala (K164A) 
H165—Ala (H165A) mutant (Nsp1-mt) of both 
SARS-CoV and SARS-CoV-2 in Escherichia coli 
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and tested their binding efficiencies to puri- 
fied human ribosomal subunits (Fig. 1B and 
fig. S1A). Nsp1 from both CoVs associated 
strongly with 40S subunits but not with 60S 
subunits, whereas both Nsp1-mt constructs 
showed no binding (Fig. 1B). Thus, ribosome 
binding to the 40S subunit is preserved and 
residues K164 and H165 of Nsp1 from both 
SARS-CoVs are important for this ribosome 
interaction. To further verify this, we expressed 
wild-type or mutant Nsp1 constructs in human 
embryonic kidney (HEK) 293T cells and ana- 
lyzed ribosome association by sucrose gradient 
centrifugation. Consistent with the behavior in 
vitro, Nsp1 of CoV and CoV-2 co-migrated with 
40S ribosomal subunits and 80S ribosomes, 
but not with actively translating polyribosomes. 
In contrast, the mutant constructs barely pen- 
etrated the gradient, indicative of their loss of 
affinity for ribosomes (Fig. 1C). Compared with 
the control, the polysome profiles showed a 
shift from translating polyribosomes to 80S 
monosomes in the presence of Nsp1, indicat- 
ing global inhibition of translation. This ef- 
fect was less pronounced for the two Nsp1-mt 
constructs. Next, we performed in vitro trans- 
lation assays of capped reporter mRNA in cell- 
free translation extracts from human cells 
(HeLa S3) or rabbit reticulocytes in the pres- 
ence of Nsp1 or Nsp1-mt. Probing for the trans- 
lation products by Western blotting revealed a 
complete inhibition of translation by Nsp1 and 
only weak effects in the presence of Nsp1-mt 
constructs (Fig. 1D and fig. SIB). To test the 
inhibitory effect of Nsp1 on translation in cells, 
we expressed 3xFLAG-tagged Nsp1 of SARS- 
CoV-2 and SARS-CoV and their respective 
mutants in HEK293T cells and monitored trans- 
lation of a cotransfected capped luciferase 
reporter mRNA. Consistent with the results of 
the in vitro assays, we observed a strong re- 
duction of translation in the presence of Nsp1 
from SCoV-1 or -2, but not of the respective 
Nsp1-mt constructs (Fig. 1E). This phenotype 
was confirmed for differently tagged (V5) and 
codon-optimized versions of SCoV-2 Nsp! (fig. 
S1, C and D). Nsp7, which is derived from the 
same polyprotein precursor as Nspl, had no 
effect on translation (fig. S1C). In summary, 
Nsp1 from both SARS-CoV and SARS-CoV-2 
binds 40S and 80S ribosomes and disrupts 
cap-dependent translation. Moreover, the con- 
served KH motif close to the C terminus of 
Nsp1 is crucial for ribosome binding and trans- 
lation inhibition. 

To elucidate the molecular interaction of 
SARS-CoV-2 Nsp1 with human ribosomes, we 
reconstituted a complex from purified, re- 
combinant Nsp1 and purified human 40S 
ribosomal subunits and determined its struc- 
ture by cryo-electron microscopy (cryo-EM) 
at an average resolution of 2.6 A (Fig. 2, A 
and B; and figs. S2 and S3). In addition to the 
40S ribosomal subunit, we observed density 
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Fig. 1. Nsp1 interacts with 40S ribosomal subunits and inhibits 
translation. (A) Domain organization of Nspl and sequence alignment 

of the C-terminal segment (red line) of Nsp1 from seven human CoVs. 

The KH motif is marked. (B) In vitro binding assay of GST-TEV (GST)-tagged 
Nspl and Nspl-mt from SARS-CoV-1 (SCoV-1) and SCoV-2 with human 40S 
and 60S ribosomal subunits. A Coomassie-stained SDS-polyacrylamide gel 


electrophoresis (PAGE) gel for inputs and eluates 
S-transferase; MW, molecular weight markers. (C) 


analysis of HEK293T lysate (control) and lysate from HEK293T cells 


transiently transfected with 3xFLAG-tagged Nspl 
from SCoV-1 and SCoV-2 and Western blot analys 


corresponding to two a helices inside the 
ribosomal mRNA entry channel which could 
be unambiguously identified as the C-terminal 
part of Nsp1 from SARS-CoV-2 (Fig. 2C). In 
proximity to the helical density, we observed 
undefined globular density between ribosomal 
RNA (rRNA) helix h16 and ribosomal proteins 
uS3 and uS10. The dimensions of this ex- 
tra density roughly matched the putative di- 
mensions of the globular N-terminal domain 
of Nsp1 (Fig. 2, B and D), on the basis of the 
structure of the highly similar N terminus 
of Nsp1 from SARS-CoV, previously deter- 
mined by nuclear magnetic resonance (27). 
However, the resolution of this region in our 
cryo-EM density map was insufficient for 
unambiguous identification. The C termi- 
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is shown. GST, glutathione 
Polyribosome gradient 
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is (anti-FLAG antibody; 


nus of Nsp1 is located close to the so-called 
“latch” between rRNA helix h18 of the body 
and h34 of the head of the 40S subunit, 
which influences mRNA accommodation 
and movement during translation initia- 
tion (22, 23). When bound at this position, 
the Nsp1 C terminus blocks regular mRNA 
accommodation, thus providing an explana- 
tion for Nsp1-mediated host translation shut- 
down (Fig. 2D). 

To characterize the ribosomal targets and 
the mode of interaction of Nsp1 in human 
cells, we expressed N-terminally 3xFLAG- 
tagged Nsp1 in HEK293T cells and affinity 
purified associated native complexes for anal- 
ysis by cryo-EM and mass spectrometry (Fig. 


separate blots). (D) Western blot (top, anti-V5 antibody) and 
(bottom) of cell-free in vitro translation of a capped reporter 
reticulocytes (RRL) and HeLa S3 lysate. Controls 1 and 2, wi 
capped reporter mRNA, respectively. A Coomassie-stained SDS-PAGE gel of 
the applied (His).—-tobacco etch virus (Hisg)-tagged Nsp1 constructs is shown 
below. (E) Quantification of luciferase activity in HEK293T cel 
with indicated 3xFLAG-tagged proteins and in vitro—transcribed firefly luciferase 
mRNA. Bars represent means + SEM (n = 6 samples). RLU, 
Representative immunoblots of whole-cell lysates (WCL) stained with anti-FLAG 
and anti-glyceraldehyde-3-phosphate dehydrogenase (GAPDH). **P < 0.001 
[unpaired Student's t test (Welch correction)]. 


2E, figs. S2 and S3, and data S1). Structural 
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SDS-PAGE analysis 
mRNA with rabbit 
h and without 


Is transfected 


elative light units. 


analysis revealed 40S and 80S ribosomal com- 
plexes in nine compositionally different states 
(Fig. 2, F to N). All of them displayed density 
for the Nsp1 C terminus in an identical posi- 
tion and conformation observed in the in vitro- 
assembled complex, and all complexes lacked 
density corresponding to mRNA. 

The Nsp1-bound 40S ribosomal complexes 
could be divided into three major populations. 
The first represents idle Nsp1-40S complexes 
(Fig. 2F), essentially resembling the in vitro- 
reconstituted complex. The second population 
comprises unusual, pre-40S-like complexes 
(Fig. 2, G and H), in which the cytosolic ri- 
bosome biogenesis factor TSR1 is bound in 
two distinct conformations between the 40S 
head and body (24, 25). These complexes do 
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Fig. 2. Cryo-EM structures of Nsp1-bound ribosomal complexes. (A) SDS- 
PAGE analysis of reconstituted Nspl-40S complexes. Nspl is labeled with an 
asterisk. MW, molecular weight markers. (B) Reconstituted Nsp1-40S structure 
with Nspl shown in pink; rRNA and proteins are shown in yellow. Additional 
density between uS3 and hl6 assigned to the N-terminal fold of Nspl is shown. 
bk, beak; pf, platform; If, left foot; rf, right foot. (©) C-terminal helix 1 and 2 of 
Nspl with corresponding densities. (D) Cross-section of the 40S, highlighting the 


central position of Nsp1 within the mRNA tunnel. The putative position of the 
N-terminal domain of Nsp1 is schematically indicated [models are based on 
PDB-2HSX (21) and PDB-6Y0G (47)]. (E) SDS-PAGE analysis of Nsp1-ribosomal 
complexes affinity purified from HEK293T cells. Proteins identified in the 
cryo-EM structures were labeled according to mass spectrometry analysis 
(data S1). (F to N) Cryo-EM maps of affinity-purified Nspl-ribosomal complexes. 
Additional factors are colored and labeled accordingly. 


not resemble any known on-pathway bio- 
genesis intermediates. The third population 
represents eukaryotic initiation factor 3 (eIF3)- 
containing 43S preinitiation complexes (PICs) 
and could be further divided into PICs with 
and without eIF1A, eIF1, and a fully assembled 
eIF2-tRNA;-guanosine triphosphate (GTP) 
complex (Fig. 2, I and J) (26-28). Both PICs 
adopt the previously observed open confor- 
mation (28). The stable association of Nsp1 in 
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the cell with multiple different intermediate 
states of translation initiation besides empty 
40S ribosomal complexes is in agreement 
with the proposed role of Nsp1 as an inhibitor 
of translation initiation (15). 

The Nsp1-bound 80S complexes could be 
divided into two major populations of trans- 
lationally inactive ribosomes. The first popu- 
lation (Fig. 2, K and L, and fig. $4, A to E) 
contained the protein coiled-coil domain 
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containing short open reading frame 124 
(CCDC124), a homolog of the ribosome pro- 
tection and translation recovery factor Lso2 
in Saccharomyces cerevisiae (29). A similar 
complex of inactive 80S ribosomes bound 
to CCDC124 was recently described (30). In 
addition to the known hibernation complex, 
a subpopulation of the CCDC124-bound 80S 
contained also the ribosome recycling factor 
and ABC-type ATPase ABCE1 (31-33) and the 
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Fig. 3. Molecular basis of Nsp1 ribosome interaction and inhibition. (A) Cryo-EM 
map of in vitro-reconstituted Nsp1-40S and segmented density of Nspl-C, uS3 
(residues 97 to 153 and 168 to 189), uS5 (102 to 164), and rRNA helix h18 with the 
corresponding models. Interacting residues are shown as sticks. (B) Nspl-C surface, 
colored by electrostatic potential from —5 (red) to +5 (blue). (€) Model of Nsp1-C 
and surface representation of the models of uS3 (residues 97 to 153 and 168 to 189), 
uS5 (102 to 164), and rRNA helix h18. Molecular interactions between Nspl and the 
ribosome are shown. (D) mRNA entry channel; 40S head is removed. Nsp1-C 
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occupies the mRNA path [PDB-6Y0G (47)]. (E) K164 and H165 of Nspl bind to a 
pocket on h18. (F) R171 and R175 of Nspl bind to the phosphate backbone of h18. 
(G) Negatively charged residues D152, E155, and £159 of al interact with uS3. (H) The 
hydrophobic interface of al and a2 binds to a hydrophobic patch on uS6. (I) Schematic 
summary of the interaction of Nsp1-C with uS3, uS5, and h18; residues belonging to 
al and a2 are colored pink. A, adenine; C, cytosine; D, aspartic acid; E, glutamic acid; 
F, phenylalanine; G, guanine; H, histidine; |, isoleucine; K, lysine; L, leucine; M, 
methionine; R, arginine; T, threonine; U, uracil; V, valine; W, tryptophan; Y, tyrosine. 


class I translation termination factor eRF1 in 
an unusual conformation (fig. $4, C to E). The 
previously unresolved, flexible C-terminal part 
of CCDC124 was stably bound to the ribosomal 
A-site in this complex. This subpopulation might 
represent a previously unidentified ribosome 
recycling-like state. 
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The second major population of Nsp1-bound 
80S ribosomes (Fig. 2, M and N) lacked 
CCDC124 but contained the cell growth- 
regulating nucleolar protein Ly 1 antibody 
reactive (LYAR), which has been implicated 
in processing of pre-rRNA and in negative reg- 
ulation of antiviral innate immune responses 
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(34, 35). We found the C terminus of LYAR 
occupying the ribosomal A-site, similar to 
CCDC124 (Fig. 2M and fig. S4, F and G). 
Furthermore, we identified a subpopulation 
among the LYAR-bound inactive 80S ribo- 
somes that contained a ternary eEFIA-GTP- 
tRNA complex (Fig. 2N and fig. S4, H to K). 
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Fig. 4. Inhibition of the innate immune response by SARS-CoV-2 Nsp1. 

(A) Quantification of IFN-B promoter-—controlled firefly luciferase activity 

in HEK293T cells transiently expressing 3xFLAG-tagged or nontagged (RV P) 
proteins. Cells were infected with SeV or left uninfected. Representative 
immunoblots of whole-cell lysates (WCLs) stained with anti-RV P, anti-FLAG, and 
anti-GAPDH are shown (bottom panel). (B) Enzyme-linked immunosorbent assay 
results for IFN-B, IFN-Al, and IL-8 in the supernatant of HEK293T cells transiently 
expressing 3xFLAG-tagged proteins and infected with SeV (top panel) for 24 hours. 
Quantitative polymerase chain reaction (qPCR) results for corresponding mRNAs 
are shown in the bottom panel. (C and D) Quantification of ISRE promoter- 


This ternary complex was in an unusual con- 
formation, with the anticodon loop contact- 
ing an a helix of the LYAR C terminus. Such a 
complex has not been previously described 
and its functional relevance is unknown. 
Taken together, we found Nsp1 bound 
to the mRNA entry channel of a distinct 
set of translationally inactive 80S ribosomes, 
among which were unusual complexes. It 
is unclear whether these are a result of the 
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presence of Nsp1 or whether they occur nat- 
urally and have an increased affinity for Nsp1 
due to their distinct conformation or lack 
of mRNA. 

All observed ribosomal complexes displayed 
the same binding mode of Nsp1 to the 40S 
subunit, in which the C-terminal domain of 
Nsp1 (NspI1-C) is rigidly bound inside the 
mRNA entry channel. Here, it interacts with 
the rRNA helix h18, with the ribosomal pro- 
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controlled firefly luciferase activity in 
3xFLAG-tagged proteins in single amounts (C) or increasing amounts (D) and 
treated with 1000 U/ml IFN-B as indicated. Representative immunoblots of 
WCLs stained with anti-FLAG and anti-GAPDH are shown in the bottom panels. 
(E) Representative immunoblots and 
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HEK293T cells transiently expressing 


quantification of WCLs of HEK293T cells 


IFN-B and stained for endogenous RIG-I, |SG15, and 


GAPDH. qPCR results for the corresponding mRNAs are shown in the bottom 
two panels. In (A), (C), and (D), bars 
and (E), bars represent means + SE 


represent means + SEM of six samples; in (B) 
of three samples. ns, not significant; 
[unpaired Student's t test (Welch correction)]. 


tein uS5 of the 40S body, and with uS3 of 
the 40S head. The local resolution of 2.6 A 
(fig. S3) allowed for a detailed analysis of 
the molecular interactions of Nsp1 with the 
ribosome (Fig. 3A). 

The shorter, first a helix of Nsp1-C (a1; res- 
idues 154 to 160) interacts with uS3 and uS5. 
The helix is followed by a short loop, which 
contains the essential KH motif that inter- 
acts with h18. This part of h18 belongs to the 
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so-called 530-loop, which actively participates 
in ribosomal decoding and has been reported 
to resemble a conserved structural motif in the 
3'UTR of beta-CoVs (36). The second, larger o 
helix of Nsp1-C (a2; residues 166 to 179) also 
interacts with rRNA h18 and connects back 
to uS5 at its C-terminal end. The two helices 
stabilize each other through hydrophobic in- 
teractions. The electrostatic potential on the 
Nsp1-C surface displays three major patches 
(Fig. 3B): a negatively charged patch on al 
facing positively charged residues on uS3; a 
positively charged patch on a2 facing the 
phosphate backbone of h18; and a hydro- 
phobic patch at the o1-a2 interface which is 
exposed to hydrophobic residues on uS5. In 
addition to the matching surface charge, 
the shape of Nsp1-C matches the shape of the 
mRNA channel and completely overlaps the 
regular mRNA path (Fig. 3, C and D). To- 
gether, this explains the strong inhibitory ef- 
fect on translation observed in vitro and in vivo. 
A key interaction is established through the 
KH motif, which binds to a distinct site on 
rRNA helix h18 (Fig. 3, C and E); K164 of Nsp1 
inserts into a negatively charged pocket, con- 
stituted mainly of the phosphate backbone of 
rRNA bases G625 and U630, whereas H165 
stacks in between U607 and U630. The base 
U630 is stabilized in this position through 
interaction with the backbone of G168 of 
Nspl. Further interactions involve R171 and 
R175 of Nsp1, which form salt bridges to the 
backbone phosphates of G601, C607, A605, 
and G606 of h18 (Fig. 3F). The interactions 
of Nsp1-C and uS3 are established through 
salt bridges and hydrogen bonds between 
D152, E155, and E159 of Nsp1 and R116, R143, 
and M150 of uS3 (Fig. 3G). The interactions 
of Nsp1-C with uS5 occur within a hydropho- 
bic surface of ~440 A’ and involve residues 
Y154, F157, W161, T170, L173, M174, and L177 
of Nsp1 and residues V106, 1109, P111, T122, 
F124, V147, and 1151 of uS5 (Fig. 3H). Taken 
together, specific molecular contacts (summa- 
rized in Fig. 31) rigidly anchor Nsp1 and thereby 
obstruct the mRNA entry channel. 

Type I interferon (IFN) induction and sig- 
naling represents one of the major innate 
antiviral defense pathways, ultimately leading 
to the induction of several hundred antiviral 
IFN-stimulated genes (ISGs) (37). Corona- 
virus infections are sensed by retinoic acid- 
inducible gene I (RIG-D, which activates this 
defense system (37, 38). To assess the effects 
of SARS-CoV-2 Nsp1 on the IFN system, we 
stimulated HEK293T cells with Sendai virus 
(SeV), a well-known trigger of RIG-I-dependent 
signaling (39, 40). Expression of Nsp1 com- 
pletely abrogated the translation of firefly 
luciferase controlled by the human IFN-B 
promoter, whereas the Nsp1-mt had no sig- 
nificant effect (Fig. 4A and fig. S5A), confirm- 
ing the results of the in vitro translation assays. 
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Rabies virus P protein (RV P) (42) and SARS- 
CoV-2 Nsp7 were used as positive and negative 
controls, respectively. After stimulation with 
SeV, the protein levels of endogenous IFN-B, 
IFN-AI, and interleukin-8 (IL-8) (Fig. 4B and 
fig. S5, B and C) in the supernatant of Nsp1- 
expressing cells were drastically reduced, al- 
though transcription of the corresponding 
mRNAs was induced. Again, Nsp1-mt showed 
no inhibitory effect. Expression of luciferase 
driven by the IFN-stimulated response ele- 
ment (ISRE), which is part of the promoter of 
most ISGs, was effectively shut down by Nsp1, 
but not by Nsp1-mt, in a dose-dependent man- 
ner (Fig. 4, C and D, and fig. S5D). SARS-CoV-2 
Nsp7 and measles virus V protein (MeV V) 
(40, 42) served as negative and positive con- 
trols, respectively. In line with these findings, 
Nsp1 but not Nsp1-mt suppressed the induc- 
tion of endogenous RIG-I and ISG15 upon 
IFN-f stimulation on the protein but not the 
mRNA level (Fig. 4E). 

Not all innate immune responses require 
active translation for function. For example, 
autophagy is barely affected by the expression 
of Nsp1 or its mutant (fig. SSE), even upon 
induction with rapamycin (43). Tripartite 
motif protein 32 (TRIM32) was used as a 
positive control (44). Taken together, these 
results demonstrate that SARS-CoV-2 Nsp1 
almost completely prevents translation not 
only of IFNs and other proinflammatory cy- 
tokines but also of IFN-stimulated anti- 
viral ISGs. 

Our data establish that one of the major 
immune evasion factors of SARS-CoV-2, Nsp1, 
efficiently interferes with the cellular translation 
machinery, resulting in a shutdown of host 
protein production. Thus, major parts of the 
innate immune system that depend on trans- 
lation of antiviral defense factors such as IFN-B 
or RIG-I (45) are disarmed. Although SARS- 
CoV-2 encodes additional potential inhibitors 
of the innate immune defenses, a loss of Nsp1 
function may render the virus vulnerable 
toward immune clearance. Thus, our data may 
provide a starting point for rational structure- 
based drug design targeting the Nsp1-ribosome 
interaction. 

However, important questions remain to 
be addressed. For example, how can the virus 
overcome the Nsp1-mediated block of trans- 
lation for the production of its own viral 
proteins? Common structural features present 
in the 5'UTR of all SARS-CoV mRNAs may 
help to circumvent the ribosome blockage by 
Nsp1 (46). 
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Brazil currently has one of the fastest-growing severe acute respiratory syndrome coronavirus 2 
(SARS-CoV-2) epidemics in the world. Because of limited available data, assessments of the 
impact of nonpharmaceutical interventions (NPIs) on this virus spread remain challenging. Using a 
mobility-driven transmission model, we show that NPls reduced the reproduction number from >3 to 
1 to 1.6 in Sao Paulo and Rio de Janeiro. Sequencing of 427 new genomes and analysis of a 
geographically representative genomic dataset identified >100 international virus introductions in 
Brazil. We estimate that most (76%) of the Brazilian strains fell in three clades that were introduced 
from Europe between 22 February and 11 March 2020. During the early epidemic phase, we found 
that SARS-CoV-2 spread mostly locally and within state borders. After this period, despite sharp 
decreases in air travel, we estimated multiple exportations from large urban centers that coincided 
with a 25% increase in average traveled distances in national flights. This study sheds new light on 


the epidemic transmission and evolutionary trajectories of SARS-CoV-2 lineages in Brazil and 
provides evidence that current interventions remain insufficient to keep virus transmission under 


control in this country. 


evere acute respiratory syndrome coro- 

navirus 2 (SARS-CoV-2) is a novel beta- 

coronavirus with a 30-kb genome that 

was first reported in December 2019 

in Wuhan, China (J, 2). SARS-CoV-2 
was declared a public health emergency of 
international concern on 30 January 2020. 
As of 12 July 2020, there were >12.5 million 
cases of coronavirus disease 2019 (COVID- 
19) and 561,000 deaths globally (3). The virus 
can be classified into two main phyloge- 
netic lineages, A and B, which spread from 
Wuhan before strict travel restrictions were 
enacted (4, 5) and now cocirculate around 
the world (6). The case fatality ratio of SARS- 
CoV-2 infection has been estimated at be- 
tween 1.2 and 1.6% (7-9), with substantially 
higher ratios in those >60 years of age (8). 
Some estimates suggest that 18 to 56% of 
SARS-CoV-2 transmission is from asymp- 
tomatic or presymptomatic individuals 
(10-13), complicating epidemiological assess- 
ments and public health efforts to curb the 
pandemic. 
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Challenges of real-time assessment 

of transmission 

Although the SARS-CoV-2 epidemics in several 
countries, including China, Italy, and Spain, 
have been brought under control through non- 
pharmaceutical interventions (NPIs) (3), the 
number of SARS-CoV-2 cases and deaths in 
Brazil continues to increase (/4) (Fig. 1A). As of 
12 July 2020, Brazil had reported 1,800,827 
SARS-CoV-2 cases, the second-largest num- 
ber in the world, and 70,398 deaths. More 
than one-third of the cases (34%) in Brazil 
are concentrated in the southeast region, 
which includes Sao Paulo city (Fig. 1B), the 
world’s fourth-largest conurbation, where the 
first case in Latin America was reported on 
25 February 2020 (15). Diagnostic assays for 
SARS-CoV-2 molecular detection were widely 
distributed across the regional reference cen- 
ters of the national public health laboratory 
network from 21 February 2020 on (6, 17). 
However, several factors, including delays in 
reporting, changes in notification, and heter- 
ogeneous access to testing across populations, 
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obfuscate the real-time assessment of virus 
transmission using SARS-CoV-2 case counts 
(15). Consequently, a more accurate measure 
of SARS-CoV-2 transmission in Brazil is the 
number of reported deaths caused by severe 
acute respiratory infections (SARIs), which 
is provided by the Sistema Unico de Satide 
(SUS) (8). Changes in the opportunity for 
SARS-CoV-2 transmission are strongly asso- 
ciated with changes in average mobility (J8—20) 
and can typically be measured by calculating 
the effective reproduction number, R, defined 
as the average number of secondary infections 
caused by an infected person. R > 1 indicates a 
growing epidemic, whereas R < 1 is needed to 
achieve a decrease in transmission. 

We used a Bayesian semimechanistic model 
(21, 22) to analyze SARI mortality statistics 
and human mobility data to estimate daily 
changes in R in Sao Paulo city (12.2 million 
inhabitants) and Rio de Janeiro city (6.7 mil- 
lion inhabitants), the largest urban metro- 
poles in Brazil (Fig. 1, C and D). NPIs in Brazil 
consisted of school closures implemented be- 
tween 12 and 23 March 2020 across the 
country’s 27 federal units/states and store 
closures implemented between 13 and 23 March 
2020. In Sao Paulo city, schools started closing 
on 16 March 2020 and stores closed 4 days later. 
At the start of the epidemics, we found R > 3 in 
Sao Paulo and Rio de Janeiro and, concurrent 
with the timing of state-mandated NPIs, R 
values fell close to 1. 


Mobility-driven changes in R 


Analysis of R values after NPI implementation 
highlights several notable mobility-driven fea- 
tures. There was a period immediately after 
NPIs, between 21 and 31 March 2020, when 
R was consistently <1 in Sao Paulo city (Fig. 
1C). However, after this initial decrease, the R 
value for SAo Paulo rose to >1 and increased 
through time, a trend associated with increased 
population mobility. This can be seen in the 
Google transit stations index, which rose from 
-60 to -52%, and by a decrease in the social 
isolation index from 54 to 47%. By 4 May 2020, 
we estimate R = 1.3 [95% Bayesian credible 
interval (BCT): 1.0 to 1.6] in both Sao Paulo and 
Rio de Janeiro cities (table S1). However, we 
note that there were instances in the previous 
7 days when the 95% credible intervals for R 
included values <1, drawing attention to the 
fluctuations and uncertainty in the estimated 
R for both cities. 

Early sharing of genomic sequences, includ- 
ing the first SARS-CoV-2 genome, Wuhan- 
Hu-1, released on 10 January (23), has enabled 
unprecedented global levels of molecular test- 
ing for an emerging virus (24, 25). However, 
despite the thousands of virus genomes de- 
posited on public access databases, there is a 
lack of consistent sampling structure and there 
are limited data from Brazil (26-28), which 
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hampers accurate reconstructions of virus 
movement and transmission using phylo- 
genetic analyses. To investigate how SARS- 
CoV-2 became established in the country, 
and to quantify the impact of NPIs on virus 


A 


No. SARS-CoV-2 cases 
per day (log, thousands) 


Sao Paulo city 


Reproduction number, R 


2 Mar 


Fig. 1. SARS-CoV-2 epidemiology and epidemic spread in Brazil. 

(A) Cumulative number of SARS-CoV-2 reported cases (blue) and deaths 
(gray) in Brazil. (B) States are colored according to the number of cumulative 
confirmed cases by 30 April 2020. (€ and D) R over time for the cities of 
Sao Paulo (C) and Rio de Janeiro (D). R values were estimated using a 
Bayesian approach incorporating the daily number of deaths and four 
variables related to mobility data (a social isolation,index from Brazilian 


spatiotemporal spread, we tested a total of 
26,732 samples from public and private lab- 
oratories using real-time quantitative poly- 
merase chain reaction (RT-qPCR) assays and 
found 7944 (29%) to be positive for SARS- 
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CoV-2. We then focused our sequencing ef- 
forts on generating a large and spatially rep- 
resentative genomic dataset with curated 
metadata to maximize the association be- 
tween the number of sequences and the 
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geolocation company InLoco and Google mobility indices for time spent 

in transit stations, parks, and the average between groceries and pharmacies, 
retail and recreational, and workspaces). Dashed horizontal line indicates 

R = 1. Gray area and geometric symbols show the times at which NPls 
were implemented. BCls of 50 and 95% are shown as shaded areas. The 
two-letter ISO 3166-1 codes for the 27 federal units in Brazil are provided in 
the supplementary materials. 
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number of SARS-CoV-2 confirmed cases per 
state. 


Spatially representative sequencing efforts 


We generated 427 new SARS-CoV-2 genomes 
with >75% genome coverage from Brazilian 
samples collected between 5 March and 
30 April 2020 (figs. S1 to S3 and data S1). For 
each state, the time between the date of the 
first reported case and the collection date of the 
first sequence analyzed in that state was only 
4.5 days on average (Fig. 2A). For eight federal 
states, genomes were obtained from samples 
collected up to 6 days before the first case no- 
tifications. The genomes generated here were 


A 


collected in 85 municipalities across 18 of 
27 federal units spanning all regions in Brazil 
(Fig. 2A and fig. S2). Sequenced genomes were 
obtained from samples collected 4 days on 
average (median, range: 0 to 29 days) after 
the onset of symptoms and were generated 
in three laboratories using harmonized se- 
quencing and bioinformatic protocols (table 
$2). When we include 63 additional available 
sequences from Brazil deposited in GISAID 
(29) (see data S1 and S2), we found the dataset 
to be representative of the spatial heterogeneity 
of the Brazilian epidemic. Specifically, the num- 
ber of genomes per state strongly correlated 
with SARI SARS-CoV-2 confirmed cases and 
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Fig. 2. Spatially representative genomic sampling. (A) Dumbbell plot showing the time intervals between 
date of collection of sampled genomes, notification of first cases, and first deaths in each state. Red lines 
indicate the lag between the date of collection of first genome sequence and first reported case. The key for 
the two-letter ISO 3166-1 codes for Brazilian federal units (or states) are provided in the supplementary 
materials. (B) Spearman's rank correlation between the number of SARI SARS-CoV-2 confirmed and SARI 
cases with unknown etiology against the number of sequences for each of the 21 Brazilian states included 
in this study (see also fig. S4). Circle sizes are proportional to the number of sequences for each federal unit. 
(C) Interval between the date of symptom onset and the date of sample collection for the sequences 


generated in this study. 
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SARI cases with unknown etiology per state 
(n = 490 sequences from 21 states, Spearman’s 
correlation, p = 0.83; Fig. 2A). This correlation 
varied from 0.70 to 0.83 when considering 
SARI cases and deaths caused by SARS-CoV-2 
and SARI cases and deaths from unknown 
etiology (fig. S4). Most (n = 485/490) Brazilian 
sequences belong to SARS-CoV-2 lineage B, 
with only five strains belonging to lineage A 
(two from Amazonas, one from Rio Grande 
do Sul, one from Minas Gerais, and one from 
Rio de Janeiro; data S1 and fig. S5 show de- 
tailed lineage information for each sequence). 
Moreover, we used an in silico assessment of 
diagnostic assay specificity for Brazilian strains 
(n = 490) to identify potential mismatches in 
some assays targeting these strains. We found 
that the forward primers of the Chinese CDC 
and Hong Kong University nucleoprotein- 
targeting RT-qPCR may be less appropriate for 
use in Brazil than other diagnostic assays, for 
which few or no mismatches were identified 
(fig. S6 and table S3). The impact of these 
mismatches on the sensitivity of these assays 
should be confirmed experimentally. If sen- 
sitivity is affected, then the use of duplex 
RT-qPCR assays that concurrently target dif- 
ferent genomic regions may help in the detec- 
tion of viruses with variants in primer- or 
probe-binding regions. 


Phylogenetic analyses and 
international introductions 


We estimated maximum likelihood and mo- 
lecular clock phylogenies for a global dataset 
with a total of 1182 genomes sampled from 
24 December 2019 to 30 April 2020 (root-to- 
tip genetic distance correlation with sampling 
dates, 7” = 0.53; Fig. 3A and fig. $7). We in- 
ferred a median evolutionary rate of 1.13 x 1077 
(95% BCI: 1.03 to 1.23 x 10-?) substitutions per 
site per year using an exponential growth co- 
alescent model, equating to 33 changes per 
year on average across the virus genome. This 
is within the range of evolutionary rates esti- 
mated for other human coronaviruses (30-33). 
We estimate the date of the common ances- 
tor (TMRCA) of the SARS-CoV-2 pandemic 
to around mid-November 2019 (median = 
19 November 2019, 95% BCI: 26 October 2019 
to 6 December 2019), which is consistent with 
recent findings (34, 35). 

Phylogenetic analysis revealed that the ma- 
jority of the Brazilian genomes (76%, 7 = 370/ 
490) fell into three clades, hereafter referred 
to as Clade 1 (n = 186/490, 38% of Brazilian 
strains), Clade 2 (n = 166, 34%), and Clade 3 
(n = 18/490, 4%) (Fig. 3A and figs. S8 and S9), 
which were largely in agreement with those 
identified in a phylogenetic analysis using 
13,833 global genomes. The most recent com- 
mon ancestors of the three main Brazilian clades 
(Clades 1 to 3) were dated from 28 February 
(21 February to 4 March 2020) (Clade 1), 
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Fig. 3. Evolution and spread of SARS-CoV-2 in Brazil. (A) Time-resolved 
maximum clade credibility phylogeny of 1182 SARS-CoV-2 sequences, 490 of which 
are from Brazil (salmon) and 692 from outside of Brazil (blue). The largest Brazilian 
clades are highlighted by gray boxes (Clade 1, Clade 2, and Clade 3). Inset shows a 
root-to-tip regression of genetic divergence against dates of sample collection. Red 
tip corresponds to the first reported case in Brazil. (B) Dynamics of SARS-CoV-2 
import events in Brazil. Dates of international and national (between federal states) 


22 February (17 to 24 February 2020) (Clade 2), 
to 11 March (9 to 12 March 2020) (Clade 3) 
(Fig. 3A and fig. S10). This indicates that 
community-driven transmission was already 
established in Brazil by early March, suggest- 
ing that international travel restrictions ini- 
tiated after this period would have had limited 
impact. Brazilian Clade 1 is characterized by 
a nucleotide substitution in the spike protein 
(G25088T, numbering relative to GenBank 
reference NC_045512.2) and circulates pre- 
dominantly in Sao Paulo state (nm = 159, 85.4%; 
figs. S9 and S11). Clade 2 is defined by two 
nucleotide substitutions in ORF6 (T27299C) 
and nucleoprotein (T29148C); this is the most 
spatially widespread lineage, with sequences 
from a total of 16 states in Brazil. Clade 3 is 
concentrated in Ceara state (n = 16, 89%) and 
falls in a global cluster with sequences mainly 
from Europe. In the Amazon region, where 
the epidemic is expanding rapidly (14, 22), 
we found evidence for multiple national and 
international introductions, with 37% (n = 7/19) 
of sequences from Para and Amazonas states 
clustering in Clade 1 and 32% (n = 6/19) in 
Clade 2. 

Time-measured phylogeographic analyses 
revealed at least 102 (95% BCI: 95 to 109) in- 
ternational introductions of SARS-CoV-2 in 
Brazil (Fig. 3A and figs. S8 and S12). This 
represents an underestimate of the real num- 
ber of introductions because we sequenced, 
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Clade 2 


on average, only one out of 200 confirmed 
cases. Most of these estimated introductions 
were directed to internationally well-connected 
states (36) such as Sao Paulo (36% of all im- 
ports), Minas Gerais (24%), Ceara (10%), and 
Rio de Janeiro (8%) (fig. $12). We further 
assessed the contribution of international 
versus national virus lineage movement events 
through time (Fig. 3B). In the first phase of the 
epidemic, we found an increasing number of 
international introductions until 10 March 2020 
(Fig. 2B). Limited available travel history data 
(15) suggested that these early cases were pre- 
dominantly acquired from Italy (26%, n = 70 
of 266 unambiguously identified country of 
infection) and the United States (28%, n = 76 
of 266). After this initial phase, we found that 
the estimated number of international imports 
decreased concomitantly with the decline in the 
number of international passengers traveling 
to Brazil (Fig. 3, B and C, and S13). By con- 
trast, despite the declines in the number of 
passengers traveling on national flights (Fig. 
3C), we detected an increase in virus lineage 
movement events between Brazilian regions 
at least until early April 2020. 


Modeling spatiotemporal spread within Brazil 


To better understand virus spread across 
spatiotemporal scales within Brazil, we used a 
continuous phylogeographic model that maps 
phylogenetic nodes to their inferred origin loca- 
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1 Feb 1 Mar 1 Apr 
Date (month and day, 2020) 


1 May 


migration events were estimated from virus genomes using a phylogeographic 
approach. The first phase was dominated by virus migrations from outside of Brazil, 
whereas the second phase was marked by virus spread within Brazil. Dashed vertical 
lines correspond to the mean posterior estimate for migration events from outside 
of Brazil (blue) and within Brazil (red). (C) Locally estimated scatterplot smoothing 
of the daily number of international (blue) and national (red) air passengers in Brazil in 
2020. To, date of first reported case in Brazil (25 February 2020). 


tions (37) (Fig. 4). We distinguished branches 
that remain within a state versus those that cross 
a state to infer the proportion of within-state 
versus between-state observed virus movement. 

We estimate that during the first epidemic 
phase, SARS-CoV-2 spread mostly locally and 
within state borders. By contrast, the second 
phase was characterized by long-distance move- 
ment events and the ignition of the epidemic 
outside of the southeast region of Brazil (Fig. 
4A). Throughout the epidemic, we found that 
within-state virus lineage movement was, on 
average, 5.1-fold more frequent than between- 
state movement. Moreover, our data suggest 
that within-state virus spread and, to a lesser 
extent, between-state virus spread decreased 
after the implementation of NPIs (Fig. 4B). 
However, the more limited sampling after 
6 April 2020 (see fig. S2) decreased inferred 
virus lineage movement to the present (Figs. 
3B and 4B). 

We found that the average route length 
traveled by passenger increased by 25% during 
the second phase of the epidemic (Fig. 4C) 
despite a concomitant reduction in the num- 
ber of passengers flying within Brazil (Fig. 3C). 
The increase in the average route length after 
NPI implementation resulted from a larger 
reduction in the number of air passengers 
flying on shorter-distance journeys compared 
with those flying on longer-distance journeys. 
For example, we found an 8.8-fold reduction in 
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Fig. 4. Spread of SARS-CoV-2 in Brazil. (A) Spatiotemporal reconstruction of 
the spread of Brazilian SARS-CoV-2 clusters containing more than two 
sequences during the first (left) and the second (right) epidemic phase (Fig. 3B). 
Circles represent nodes of the maximum clade credibility phylogeny and are 
colored according to their inferred time of occurrence. Shaded areas represent 
the 80% highest posterior density interval and depict the uncertainty of the 
phylogeographic estimates for each node. Solid curved lines denote the links 
between nodes and the directionality of movement. Sequences belonging to 
clusters with fewer than three sequences were also plotted on the map with no 


Date (months and days of 2020) 


lines connecting them. Background population density for each municipality was 
obtained from the Brazilian Institute of Geography (https://www.ibge.gov.br/). 
See fig. S14 for details of virus spread in the southeast region. (B) Estimated 
number of within-state (or within a given federal unit) and between-state (or 
between federal units) virus migrations over time. Dashed lines indicate 
estimates obtained during the period of limited sampling (fig. S2). (€) Average 
distance in kilometers traveled by an air passenger per day in Brazil. The 
number of daily air passengers is shown in Fig. 3B. Light gray boxes indicate the 
starting dates of NPls across Brazil. 


the number of passengers flying in flight legs 
<1000 km, compared with a 4.4-fold reduction 
in those flying >2000 km (fig. S15). These find- 
ings emphasize the roles of within- and between- 
state mobility as a key driver of both local and 
interregional virus spread, with highly popu- 
lated and well-connected urban conurbations 
in the southeast region acting as the main sources 
of virus exports within the country (fig. S12). 


Discussion 


We provide a comprehensive analysis of SARS- 
CoV-2 spread in Brazil showing the importance 
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of community- and nation-wide measures to 
control the COVID-19 epidemic in Brazil. Al- 
though NPIs initially reduced virus transmis- 
sion and spread, the continued increase in 
the number of cases and deaths in Brazil 
highlights the urgent need to prevent future 
virus transmission by implementing rapid and 
accessible diagnostic screening, contact tracing, 
quarantining of new cases, and coordinated 
social and physical distancing measures across 
the country (38). With the recent relaxation of 
NPIs in Brazil and elsewhere, continued mo- 


lecular, immunological, and genomic surveil- 
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lance are required for real-time data-driven 
decisions. Our analysis shows how changes in 
mobility may affect global and local transmis- 
sion of SARS-CoV-2 and demonstrates how 
combining genomic and mobility data can com- 
plement traditional surveillance approaches. 
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Engineering human ACE2 to optimize binding to the 
spike protein of SARS coronavirus 2 
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The spike (S) protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) binds 
angiotensin-converting enzyme 2 (ACE2) on host cells to initiate entry, and soluble ACE2 is a therapeutic 
candidate that neutralizes infection by acting as a decoy. By using deep mutagenesis, mutations in ACE2 
that increase S binding are found across the interaction surface, in the asparagine 90-glycosylation 
motif and at buried sites. The mutational landscape provides a blueprint for understanding the specificity 
of the interaction between ACE2 and S and for engineering high-affinity decoy receptors. Combining 
mutations gives ACE2 variants with affinities that rival those of monoclonal antibodies. A stable dimeric 
variant shows potent SARS-CoV-2 and -1 neutralization in vitro. The engineered receptor is catalytically 
active, and its close similarity with the native receptor may limit the potential for viral escape. 


n late 2019, a novel zoonotic betacorona- 

virus closely related to bat coronaviruses 

crossed into humans in the Chinese city of 

Wuhan (J, 2). The virus, called severe acute 

respiratory syndrome coronavirus 2 (SARS- 
CoV-2) because of its similarities with the SARS 
coronavirus first discovered in 2003 (3, 4), 
causes coronavirus disease 2019 (COVID-19) 
(5), which is producing devastation across the 
globe. 

The spike (S) glycoprotein of SARS-CoV-2 
binds angiotensin-converting enzyme 2 (ACE2) 
on host cells (2, 6-11). S is a trimeric class I viral 
fusion protein that is proteolytically processed 
into S1 and S2 subunits that remain non- 
covalently associated in a prefusion state 
(6, 9, 12). Upon engagement of ACE2 by a re- 
ceptor binding domain (RBD) in S1 (73), con- 
formational rearrangements occur that cause 
S1 shedding, cleavage of S2 by host proteases, 
and exposure of a fusion peptide adjacent to 
the S2' proteolysis site (12, 14-16). Folding of 
S to a postfusion conformation is coupled to 
host cell-virus membrane fusion and cyto- 
solic release of viral RNA. Atomic contacts 
with the RBD are restricted to the extra- 
cellular protease domain of ACE2 (17, 18). 
Soluble ACE2 (sACE2) in which the trans- 
membrane domain has been removed is suf- 
ficient for binding S and neutralizing infection 
(0, 19-21). A broad collection of highly potent 
neutralizing antibodies have been isolated 
(22-28), yet the virus spike shows rapid ac- 
cumulation of escape mutations when under 
selection (29). By comparison, the virus may 
have limited potential to escape sACE2- 
mediated neutralization without simultaneously 
decreasing affinity for native ACE2 receptors, 
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an outcome that is likely to attenuate virulence. 
Furthermore, sACE2 could potentially treat 
COVID-19 symptoms by proteolytic conversion 
of angiotensin peptides that regulate blood 
pressure and volume (30, 37). Recombinant 
sACE2 is safe in healthy human subjects (32) 
and patients with lung disease (33), and is 
being evaluated in a European phase 2 clinical 
trial for COVID-19 managed by Apeiron Bio- 
logics. Peptide derivatives of ACE2 are also 
being explored as cell entry inhibitors (34). 

Because human ACE2 has not evolved to 
recognize SARS-CoV-2 S, we hypothesized that 
mutations may be found that increase affinity. 
The coding sequence of full-length ACE2 with 
an N-terminal c-MYC epitope tag was diver- 
sified to create a library containing all possible 
single-amino acid substitutions at 117 sites 
that span the interface with S and the angiotensin 
peptide-binding cavity. S binding is independent 
of ACE2 catalytic activity (35) and occurs on 
the outer surface of ACE2 (17, 18), whereas 
angiotensin substrates bind within a deep cleft 
that houses the active site (36). 

The ACE2 library was transiently expressed 
in human Expi293F cells under conditions 
that typically yield no more than one coding 
variant per cell, providing a tight link between 
genotype and phenotype (37, 38). Cells were 
then incubated with a subsaturating dilution 
of medium containing the RBD of SARS-CoV-2 
fused to superfolder green fluorescent protein 
[sfGFP; (39)] (fig. SIA). Dual-color flow cytom- 
etry measurements show that amounts of 
bound RBD-sfGFP correlate with surface ex- 
pression levels of MYC-tagged ACE2. Com- 
pared with cells expressing wild-type ACE2 
(fig. SIC), many variants in the ACE2 library 
fail to bind RBD, whereas a smaller number of 
ACE2 variants showed higher binding signals 
(fig. SID). Populations of cells that express 
ACE2 variants at the cell surface with high 
(“‘nCoV-S-High”) or low (“nCoV-S-Low”) bind- 
ing to RBD were collected by fluorescence- 
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activated cell sorting (FACS) (fig. S1D). During 
FACS, the fluorescence signal for bound RBD- 
sfGFP continuously declined, requiring the 
collection gates to be regularly updated to 
“chase” the relevant populations. This is con- 
sistent with RBD dissociating during the 
experiment. 

In an approach known as deep mutagenesis 
(40), the enrichment or depletion of all 2340 
coding mutations in the library was deter- 
mined by comparing the frequencies of tran- 
scripts in the sorted populations to sequence 
frequencies in the naive plasmid library (Fig. 1A). 
Enrichment ratios and residue conservation 
scores closely agree between two independent 
FACS experiments (fig. S2). Enrichment ratios 
and conservation scores in the nCoV-S-High 
sorted cells tend to be negatively correlated 
with the nCoV-S-Low sorted cells, with the 
exception of nonsense mutations that do not 
express and were therefore depleted from both 
populations (fig. $2). Most, but not all, non- 
synonymous mutations in ACE2 did not elimi- 
nate surface expression (fig. S2). The library is 
biased toward solvent-exposed residues and 
has few substitutions of buried hydrophobic 
residues that might have greater effects on 
plasma membrane trafficking (38). 

Mapping the experimental conservation 
scores from the nCoV-S-High sorted cells to 
the structure of RBD-bound ACE2 (17) shows 
that residues buried in the interface tend to be 
conserved, whereas residues at the interface 
periphery or in the substrate-binding cleft are 
mutationally tolerant (Fig. 1, B and C). The re- 
gion of ACE2 surrounding the C-terminal end 
of the ACE2 a1 helix and B3-B4 strands has a 
weak tolerance for polar residues, whereas 
amino acids at the N-terminal end of a1 and 
the C-terminal end of o2 are preferentially 
hydrophobic (Fig. 1D), likely in part to preserve 
hydrophobic packing between ol-a2. These 
discrete patches contact the globular RBD 
fold and a long protruding loop of the RBD, 
respectively. 

Two ACE2 residues, N90 and T92 that together 
form a consensus N-glycosylation motif, are 
notable hot spots for enriched mutations (blue 
in Fig. 1A). Indeed, all substitutions of N90 
and T92, with the exception of T92S, which 
maintains the N-glycan, are highly favorable 
for RBD binding, and the N90-glycan is thus 
predicted to partially hinder the S-ACE2 
interaction. These results may depend on the 
chemical nature of glycan moieties attached in 
different cell types. 

Mining the data identifies many ACE2 muta- 
tions that are enriched for RBD binding. It has 
been proposed that natural ACE2 polymor- 
phisms are relevant to COVID-19 pathogenesis 
and transmission (41, 42), and the mutational 
landscape provided here will facilitate analy- 
ses to test this. At least a dozen ACE2 muta- 
tions at the interface enhance RBD binding, 
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and the molecular basis for affinity enhance- 
ment can be rationalized from the RBD-bound 
ACE2 cryo-electron microscopy (EM) struc- 
ture (Fig. IE) (7): Hydrophobic substitutions 
of ACE2-T27 increase hydrophobic packing 
with aromatic residues of S, ACE2-D30E ex- 
tends an acidic side chain to reach S-K417, 
and aromatic substitutions of ACE2-K31 con- 
tribute to an interfacial cluster of aromatics. 
A search for affinity-enhancing mutations in 
ACE2 using targeted mutagenesis recently 
identified D30E (43), providing independent 
confirmation of this result. 
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There are also enriched mutations in the 
second shell and farther from the interface 
that do not directly contact S but instead have 
putative structural roles. For example, proline 
substitutions were enriched at five library posi- 
tions (S19, L91, T92, T324, and Q325) where 
they might entropically stabilize the first 
turns of helices. Proline was also enriched at 
H34 where it may enforce the central bulge 
in a1, and multiple mutations were enriched 
at buried positions where they will change 
local packing (e.g., A25V, L29F, W69V, F72Y, 
and L351F). The selection of ACE2 variants 
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Tolerates polar Prefers 
amino acids SI hydrophobics 


Fig. 1. Sequence preferences 
of ACE2 residues for high 
binding to the RBD of SARS- 
CoV-2 S. (A) Logs enrichment 
ratios from the nCoV-S-High 
sorts are plotted from depleted 
or deleterious (orange) to 
enriched (dark blue). ACE2 
primary structure is shown on 
the vertical axis, amino acid 
substitutions are indicated on 
the horizontal axis. Wild-type 
amino acids are in black. Aster- 
isk (*) denotes stop codon. 

(B) Conservation scores are mapped 
to the structure (Protein Data 
Bank 6M17) of RBD (green 
ribbon)—bound protease domain 
(surface), oriented with the 
substrate-binding cavity facing 
the reader. Residues conserved 
for RBD binding are shown in 
orange; mutationally tolerant 
residues are in pale colors; 
residues that are hot spots for 
enriched mutations are in blue; 
and residues maintained as wild 
type in the ACE2 library are in 
gray. Glycans are depicted as 
dark red sticks. (©) Viewed 
looking down on to the RBD 
interaction surface. (D) Average 
hydrophobicity-weighted enrich- 
ment ratios are mapped to the 
structure, with residues tolerant 
of polar substitutions in blue and 
residues that prefer hydrophobics 
in yellow. (E) A magnified view 
of the ACE2-RBD interface 
[colored as in (B) and (C)]. Heat- 
map plots logs enrichment ratios 
from the nCoV-S-High sort. Ab- 
breviations for the amino acid 
residues are as follows: A, Ala; 
C, Cys; D, Asp; E, Glu; F, Phe; 
G, Gly; H, His; |, lle; K, Lys; 

L, Leu; M, Met; N, Asn; P, Pro; 
Q, Gln; R, Arg; S, Ser; T, Thr; 

V, Val; W, Trp; and Y, Tyr. 


for high binding signal therefore reports not 
only on affinity, but also on presentation at the 
membrane of folded structure recognized 
by SARS-CoV-2 S. Whether these mutations 
selectively stabilize a virus-recognized local 
structure in ACE2 versus the global protein 
fold is unclear. 

Thirty single-amino acid substitutions highly 
enriched in the nCoV-S-High sorted cells were 
validated by targeted mutagenesis (fig. S3). 
Binding of RBD-sfGFP to full-length ACE2 
mutants measured by dual-color flow cytom- 
etry (fig. S3) increased compared with that of 
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Fig. 2. A variant of sACE2 with high affinity for S. (A) Expi293F cells 
expressing S were incubated with purified wild-type sACE2 (gray) or sACE2.v2 
(blue) fused to 8His (solid lines) or IgGl-Fc (broken lines). Bound protein 

was detected by flow cytometry. Data are mean fluorescence units (MFU) of the 
total cell population after subtraction of background autofluorescence. n = 2 
replicates, error bars represent range. (B) Binding of 100 nM wild-type sACE2-lgG1 
(broken lines) was competed with wild type sSACE2-8h (solid gray line) or sACE2. 
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500 1000 
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v2-8h (solid blue line). The competing proteins were added simultaneously 
to cells expressing S, and relative bound protein was detected by flow 
cytometry. n = 2 replicates, error bars represent range. (C) Competition for 
binding to immobilized RBD in an ELISA between serum IgG from COVID-19 
patients versus wild-type sACE2-8h (gray) or sSACE2.v2-8h (blue). Three 
different patient sera were tested (P1 to P3 in light to dark shades). Data are 
mean + SEM, n = 2 replicates. 
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Fig. 3. A dimeric sACE2 variant with improved properties for binding viral 
spike. (A) Analytical SEC of wild-type sACE22-8h (gray) and sACE22.v2.4-8h 
(purple) after incubation at 37°C for 62 hours. (B) ELISA analysis of serum IgG 
from COVID-19 patients (P1 to P3 in light to dark shades) binding to RBD. 
Dimeric sACE22(WT)-8h (gray) or SACE22.v2.4-8h (purple) are added to 


compete with antibodies recognizing the receptor-binding site. Concentrations 
are based on monomeric subunits. Data are mean + SEM, n = 2 replicates. 
(C) RBD-8h association (t = 0 to 120 s) and dissociation (t > 120 s) with 
immobilized sACE2,(WT)-IgG1 measured by BLI. (D) BLI kinetics of RBD-8h 
binding to immobilized sACE29.v2.4-IgG1. 


the wild type, yet improvements were small 
and most apparent on cells expressing low 
amounts of ACE2. Differences in ACE2 expres- 
sion between the mutants also correlated with 
total amounts of bound RBD-sfGFP (fig. S3C), 
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demonstrating the need for caution in inter- 
preting deep mutational scan data as muta- 
tions may affect both activity and expression. 
To rapidly assess mutations in a soluble format, 
we fused the ACE2 protease domain to sfGFP. 
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Expression levels of sACE2-sfGFP were eval- 
uated qualitatively by fluorescence (fig. S4A), 
and binding to full-length S expressed at the 
plasma membrane was measured by flow cytom- 
etry (fig. S4B). A single substitution (T92Q) in 
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Fig. 4. Enhanced neutralization of SARS-CoV-1 and -2 by engineered receptors. In a microneutralization assay, monomeric (solid lines) or dimeric 
(broken lines) sACE2(WT)-8h (gray) or sACE2.v2.4-8h (purple) were preincubated with virus before adding to VeroE6 cells. Concentrations are based on 
monomeric subunits. Data are mean + SEM of n = 4 replicates. 


the N90 glycosylation motif gave a modest 
increase in binding signal, which was con- 
firmed by analysis of purified protein (fig. S5). 
Focusing on the most highly enriched sub- 
stitutions in the nCoV-S-High sorted cells that 
were also spatially segregated to minimize nega- 
tive epistasis (44), combinations of mutations 
were expressed, and these gave sACE2 large 
increases in S binding (materials and methods, 
table SI, and fig. S4B). Unexplored combinations 
of mutations may have even greater effects. 

A single variant, sACE2.v2, was chosen for 
purification and further characterization (fig. 
S6). This variant was selected because it was 
well expressed as a sfGFP fusion and it main- 
tains the N90-glycan, thus presenting a sur- 
face that more closely matches that of native 
sACE2 to minimize immunogenicity. The yield 
of sACE2.v2 was lower than that of the wild- 
type protein, and by analytical size exclusion 
chromatography (SEC), a small fraction of 
sACE2.v2 was found to aggregate after incu- 
bation at 37°C (fig. S6D). Otherwise, sACE2.v2 
was indistinguishable from the wild type by 
SEC (fig. S6C). 

In flow cytometry experiments using the puri- 
fied 8His-tagged protease domain, sACE2.v2-8h, 
but not wild type, was found to bind strongly 
to full-length S at the cell surface, suggesting 
that wild-type sACE2 has a faster off-rate that 
causes dissociation during sample washing 
(Fig. 2A and fig. $7). Differences between wild 
type and the variant were less pronounced in 
the context of an immunoglobulin G1 (IgG1)- 
Fc fusion (Fig. 2A and fig. $7), indicating that 
avidity masks gains in binding of the mutant, 
again consistent with off-rate differences be- 
tween wild type and variant sACE2. Soluble 
ACE2.v2-8h outcompetes wild-type sACE2- 
IgGl for binding to S-expressing cells, yet 
wild-type sACE2-8h does not outcompete 
sACE2-IgGI1, even at 10-fold higher concentra- 
tions (Fig. 2B). Furthermore, only engineered 
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sACE2.v2-8h effectively competed with anti- 
RBD IgG in serum from three COVID-19- 
positive patients when tested by enzyme-linked 
immunosorbent assay (ELISA) (Fig. 2C). The 
observation that up to 80% inhibition was 
achieved at saturation with sACE2.v2-8h in- 
dicates that most antibodies against RBD 
were directed at the receptor-binding site. 
Finally, biolayer interferometry (BLI) showed 
that sACE2.v2 has 65-fold higher affinity than 
the wild-type protein for immobilized RBD, 
almost entirely due to a slower off-rate (table 
82 and fig. S6, E and F). 

To address the decreased expression of 
sACE2.v2, it was hypothesized that the muta- 
tional load is too high. In second-generation 
designs, each of the four mutations in sACE2.v2 
was reverted back to the wild-type identity 
(table S1), and binding to full-length S at the 
cell surface remained high (fig. S8A). One of 
the variants (SACE2.v2.4 with mutations T27Y, 
L79T and N330Y) was purified with even higher 
yields than that of the wild type and displayed 
tight nanomolar binding to the RBD (fig. S8). 

The ACE2 construct was lengthened to in- 
clude the neck or dimerization domain, yield- 
ing a stable dimer (Fig. 3A) referred to here as 
sACE2,, which binds with high avidity to S on 
the cell surface or immobilized RBD on a 
biosensor (fig. S9). Compared to the wild type, 
dimeric sACE25.v2.4 competes more effectively 
with IgG present in serum from COVID-19 
patients (Fig. 3B). The engineered dimer may 
be useful in assessing serum or plasma (e.g., 
for convalescent plasma therapies) for con- 
centrations of the most effective SARS-CoV-2 
neutralizing antibodies (45). By immobilizing 
sACE2,-IgG1 (fig. S10) to a biosensor surface 
and incubating it with monomeric RBD-8h as 
the analyte, we determined the dissociation 
constant Kp of RBD for wild-type sACE2, to 
be 22 nM (Fig. 3C), in close agreement with 
previous reports (8, 46), whereas SACE29.v2.4 
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bound with 600 pM affinity (Fig. 3D). This 
compares favorably with results from recently 
isolated monoclonal antibodies (22-28). 

The efficacy of monomeric sACE2.v2.4 in 
neutralizing SARS-CoV-2 infection of cultured 
VeroE6 cells exceeded that of the wild-type 
protein by nearly two orders of magnitude (Fig. 
4), consistent with the biochemical binding 
data. Wild-type, dimeric sACE2z, is itself two 
orders of magnitude more potent than the 
monomeric subunit, indicating strong, avid 
interactions with spike on the virion surface, 
and dimeric sACE2,.v2.4 is yet again more 
potent with a subnanomolar median inhib- 
itory concentration (Fig. 4). Dimeric sACE2,. 
v2.4 also potently neutralizes SARS-CoV-1, de- 
spite no consideration of SARS-CoV-1 S struc- 
ture or sequence during the engineering process, 
and it is possible that the decoy receptor will 
neutralize diverse ACE2-utilizing coronaviruses 
that have yet to cross over to humans. 

To improve safety, we manufactured untagged 
sACE25.v2.4 in ExpiCHO-S cells (fig SI1A) and 
found it to be stable after incubation at 37°C 
for 6 days (fig S11B). The protein competes 
with wild-type sACE2,-IgG1 for cell-expressed 
S (fig. S11C) and binds with tight avidity to 
immobilized RBD (fig S11D). In addition to 
inhibiting virus entry, recombinant sACE2 
may have a second therapeutic mechanism: 
proteolysis of angiotensin II (a vasoconstrictive 
peptide hormone) to relieve symptoms of respi- 
ratory distress (30, 31). Soluble ACE29.v2.4 is 
found to be catalytically active, albeit with 
reduced activity (fig. S12). Whether this confers 
any therapeutic advantage or disadvantage 
over wild-type sACE2 remains to be seen. 

With astonishing speed, the scientific com- 
munity has identified multiple candidates for the 
treatment of COVID-19, especially monoclonal 
antibodies with exceptional affinity for protein 
S. Our work shows how comparable affinity 
can be engineered into the natural receptor 
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for the virus, while also providing insights 
into the molecular basis for initial virus-host 
interactions. 
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LIFE SCIENCE TECHNOLOGIES 


new products: microscopy/imaging 


Autonomous Leak-Detection Camera 
FLIR Systems announces the FLIR GF77a 
Gas Find infrared camera, its first fixed- 
mount, uncooled, autonomous leak- 
detection camera designed specifically 
to visualize methane, sulfur dioxide, 
nitrous oxide, and other industrial 
gases. The connected GF77a provides upstream and midstream gas 
processors, producers, and operators with the ability to monitor 
continuously for invisible, potentially dangerous methane leaks at 
natural gas power plants, renewable-energy production facilities, 
industrial plants, and other locations along a natural gas supply 
chain. Featuring the FLIR-patented High Sensitivity Mode (HSM), the 
technology enables better detection capabilities by accentuating 
movement to make gas plumes more visible to the user. The 
radiometrically calibrated camera also measures temperature, 
making it a solution for monitoring tank levels and inspecting 
components that may overheat. The GigE Vision- and GeniCam- 
compatible GF77a includes Wi-Fi connectivity, allowing companies 
to control and stream radiometric thermal data remotely. It’s also 
Open Network Video Interface Forum (ONVIF)-compliant and 
features environmental accessories to allow customers to tailor the 
camera to their daily needs. 

FLIR Systems 

For info: +32-(0)-3665-5100 

www.flir.com/gf77a 


Cell Imaging Multi-Mode Reader 

The Cytation 7 Cell Imaging Multi-Mode Reader from BioTek 
Instruments combines an automated upright microscope, an 
inverted microscope, and multimode microplate detection ina 
single instrument. Its inverted microscope supports fluorescence, 
brightfield, and color brightfield imaging. The magnification range 
of 1.25x to 60x allows researchers to capture and analyze large 
objects as well as intracellular details with ease. An upright reflected 
and transmitted light microscope offers up to 8x magnification 

for applications such as the enzyme-linked immunospot (ELISpot) 
assay, Slide scanning with region-of-interest detection, and colony 
counting. The multimode module in Cytation 7 features quadruple 
monochromator-based fluorescence, absorbance, and luminescence. 
Temperature and gas control combined with variable shaking provide 
an ideal environment for live-cell assays. Gen5 Software controls the 
instrument for precise, accurate image and data capture as well as 
powerful data processing for quantitative and qualitative analysis. 
Cytation 7 may be integrated with BioTek’s BioSpa 8 Automated 
Incubator, enabling full imaging or multimode workflow automation 
for up to eight microplates at once. 

BioTek Instruments 

For info: 888-451-5171 

www.biotek.com 


Ultra-High-Speed Video Camera 

The Kirana7M from Specialised Imaging enables researchers to 
capture up to 180 consecutive high-resolution images at frame 
rates of up to 7 million fps and exposure times as low as 50 ns. 

It represents the ultimate high-speed video camera, combining 
the triggering flexibility of high-speed video cameras with greater 
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resolution at megahertz capture speeds. As with all versions of the 
Kirana camera, the full 924-pixel x 768-pixel resolution is maintained 
at all frame rates, enabling it to achieve high spatial and temporal 
resolution. This compact camera includes comprehensive triggering 
facilities, highly accurate timing control, and output signals for 
synchronization with various illumination sources. All camera 
functions are controlled using an intuitive software package. Full 
remote operation using Ethernet connectivity enables the unit to be 
easily integrated into almost any environment. The Kirana7M offers 
high performance, ease of use, and operational flexibility, delivering 
impressive slow-motion video images for just about any scientific 
research application. 

Specialised Imaging 

For info: 951-296-6406 
www.specialised-imaging.com/products/video-cameras/kirana 


Indirect X-Ray and Neutron-Imaging sCMOS Camera 

Andor Technology announces the Balor-x, a large-area, 
ultrasensitive scientific complementary metal-oxide semiconductor 
(sCMOS) camera for indirect X-ray and neutron imaging. Boasting 
the unique combination of a 16.9-megapixel sensor with a rapid, 
54-fps full frame rate, ultralow noise, and high dynamic range, 
Balor-X is a versatile tool for high-energy physics applications 

such as hard X-ray/neutron-based tomographic or in situ dynamic- 
phenomena diagnostics. The camera's large field of view enables 
versatile analysis not only of larger samples but also of smaller 
samples at higher resolutions. Its exceptionally fast readout allows 
up to 1,600 fps with region-of-interest detection, considerably 
reducing acquisition time for extensive tomographic datasets and 
allowing better sampling of dynamic transient in situ phenomena 
on the hundreds-of-microns scale. When minimizing beam line time 
is paramount, not only does Balor-X acquire data rapidly but also 
gives researchers greater experimental throughput and the ability 
to couple imaging with complementary techniques such as X-ray and 
neutron scattering and diffraction. 

Andor Technology 

For info: +44-(0)-28-9023-7126 

www.andor.com/balor 


Off-Axis Parabolic Mirrors 

Optical Surfaces Ltd. produces off-axis parabolic mirrors (OAPs) for 
demanding imaging applications. OAPs combine the achromatic 
and diffraction-limited imaging properties of a parabolic mirror 
with the ability to deviate the light path off-axis, which is useful for 
most imaging systems. OAPs offer the ability to focus collimated 
light without introducing spherical aberration. Additionally, unlike 
a centered parabolic mirror, OAPs advantageously allow more 
interactive space around the focal point without disrupting the 
beam. They are especially suitable for broadband or multiple- 
wavelength applications. Using proprietary production techniques 
and benefiting from a uniquely stable manufacturing environment, 
Optical Surfaces’ experienced engineering team are world renowned 
for producing fast-focusing OAP mirrors with unmatched surface 
accuracy, surface quality, and surface slope errors. 

Optical Surfaces 

For info: +44-(0)-208-668-6126 
www.optisurf.com/index.php/products/off-axis-paraboloids 


Electronically submit your new product description or product literature information! Go to www.sciencemag.org/about/new-products-section for more information. 
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Georgia 
Tech: 


The College of Sciences at the Georgia Institute of Technology announces 
two open leadership positions. The expected rank at appointment is at 
the level of full professor. A doctorate in a relavant field is required. 
Priority will be given to candidates possessing outstanding leadership 
and scholarship. 


PHYSICS CHAIR SEARCH — SCHOOL OF PHYSICS 


The School of Physics has excellent research programs and a broad 
undergraduate and graduate teaching curriculum. Research strengths 
include astrophysics, atomic and optical physics, quantum matter and 
nanoscience, soft matter, biophysics/physics of living systems, and non- 
linear sciences. A major asset is its leadership in multi- and interdisciplinary 
scholarship along with other highly ranked programs in the sciences and 
engineering at Georgia Tech. 


Please see https://physics.gatech.edu/chair-search for the full details of 
the position, including the candidate application form. 


PSYCHOLOGY CHAIR SEARCH — SCHOOL OF PSYCHOLOGY 


The School of Psychology offers doctoral training and research 
opportunities in broad areas including cognition and brain science, 
cognitive aging, engineering psychology, industrial/organizational 
psychology, and quantitative psychology, with interdisciplinary graduate 
programs in quantitative biosciences and human computer interaction. 
The school is a partner in the Institute’s interdisciplinary program 
in neuroscience. 


Please see https://psychology.gatech.edu/chair-search for the full details 
of the position, including the candidate application form. 
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NORTON THORACIC INSTITUTE, 
Saint Joseph’s Hospital, Phoenix, Arizona 


Norton Thoracic Institute (NTI) invites applications for a faculty position 
as an Assistant or Associate Professor with expertise in cancer biology, 
immunology and or genetics. NTI has been undergoing an expansion 
in its basic research program and is recruiting an individual who will 
complement and build on research in the program focused on the study of 


lung and/or esophageal cancers. 


Applicants must have a Ph.D, MD or equivalent doctoral degree, US 
Citizenship or Permanent Resident status, with ability to lead an 
independent research program as well as potential to obtain extramural 
research support. Person to be considered for the position as an Associate 
Professor must have active extramural research funding from NCI or 


other similar sources. 


A competitive salary, start-up package, excellent fully furnished 
laboratory space, and core facilities are available. A summary of research 
interests, curriculum vitae, and names of three references should be sent 
to Billie Glasscock at billie.glasscock@dignityhealth.org. Applications 


will be reviewed until the position is filled. 


LR 
Universitat Regensburg 


The Faculty of Medicine invites applications for a 


developing professorship of 
Experimental Cardiovascular 
Pharmacology in the University 


Heart Center Regensburg 
(grade W2 with tenure-track leading to 
professorship for life at grade W3) 


for a period of six years with the legal status of temporary civil 
servant (Beamtenverhdltnis auf Zeit), to be appointed as soon 
as possible. The tenure-track professorship is supported by the 
Federal/Lander program for the promotion of young scientists 
(tenure-track program). Universitat Regensburg offers the prospect, 
upon positive tenure evaluation, of transferring to a permanent 
position as W3 grade professor with the legal status civil servant 
for life (Beamtenverhdaltnis auf Lebenszeit). The requirements for 
the tenure evaluation can be found at https://go.uni-regensburg.de/ 
tt-satzung. 


We seek candidates to professionally represent the field of Phar- 
macology in research and teaching. Initially, the main focus of the 
professorship will refer to research activities. The successful candi- 
date is expected to strengthen the field of cardiovascular research 
at the faculty as well as at the university. Teaching relates to medi- 
cine and dentistry and is welcome for other degree programmes. 


Prerequisites for taking up the position are, alongside the general 
conditions from public sector employment law, a completed uni- 
versity degree; pedagogic suitability; a very good doctoral thesis 
grade; outstanding, preferably international, research experi- 
ence; and further academic performance appropriate to the early 
career stage (for example exceptional publications in internatio- 
nally peer-reviewed journals) and a two-year postdoc period. Ap- 
plicants with an MD degree desirably should have a specialization 
in Pharmacology and Toxicology and/or Clinical Pharmacology. 


Insofar as the candidate undertook employment as a research as- 
sociate or research assistant after their doctorate, the duration of 
the doctoral studies and the employment should not sum to more 
than six years/or in the case of medicine not more than nine years. 


Applicant must have changed universities following the doctorate 
or have been academically active for at least two years somewhere 
other than Universitat Regensburg before the application. 


Universitat Regensburg is particularly committed to reconciling fa- 
mily and working life (for more information, see https:/Awww.uni- 
regensburg.de/chancengleichheit). To fulfill the equality directive 
and increase the number of female professors, we explicitly en- 
courage applications from qualified women. 


In case of essentially similar suitability, applicants with severe 
disabilities will be preferentially selected. 


The prerequisites for appointment under civil servant law are based 
on the provisions of the Bavarian Public Service Code (BayBG) and 
the Bavarian University Staff Act (BayHSchPG). 


Applications containing the normal documents (resume, certificates, 
list of publications with copies of the most important papers) 
should be submitted, preferentially electronically (berufungen. 
medizin@ur.de) 


by 31.10.2020 


to the Dean of the Faculty of Medicine of Universitat Regensburg, 
Prof. Dr. Dirk Hellwig, Franz-Josef-StrauB-Allee 11, D-93053 
Regensburg. 


Information on data protection can be found at: 
https://www.uni-regensburg.de/datenschutz/ 


This is the English translation of a German job advertisement 
published by the Universitat Regensburg at https://go.uni- 
regensburg.de/stellen-professuren. Only the original German 
text is legally binding. 
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Technology 


@ Mi Massachusetts 
i I Institute of 
Ez 


Department of Earth, Atmospheric and 
Planetary Sciences (EAPS) 


Tenure-track Faculty Position 


The Department of Earth, Atmospheric and Planetary Sciences (EAPS) at the Massachusetts Institute 

of Technology (MIT) Cambridge, Massachusetts invites applications for a tenure-track faculty position in 
the broad area of Planetary Science. EAPS is an academic community of approximately 40 faculty, 

100 research staff (including postdocs), and 180 students, who together form leading research programs 
on all aspects of Earth, planetary, geo-biological, ocean, atmospheric, and climate sciences, some of 
which reside within the MIT WHOI Joint Program. 


online @sciencecareers.org 


EAPS is committed to academic excellence and to fostering a diverse, equitable, and inclusive 
environment. We seek an outstanding scientist who has the potential for innovation and leadership in 
research, commits to teaching and mentoring undergraduate and graduate students, and shares the 
Principles of our Community. 
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A complete application includes a cover letter, curriculum vitae, a 1- to 2-page statement on research 
and one on teaching and mentoring, and three letters of recommendation. Recognizing that educational 
experiences of all students are enhanced when the diversity of their backgrounds is acknowledged and 
valued, we ask candidates to articulate (in the teaching and mentoring statement, and, as appropriate, 
in the cover letter or research statement) their views on inclusivity and equity as they pertain to teaching, 
mentorship, research, and service. 


Applicants must hold a Ph.D. in planetary science, astronomy/astrophysics, or other related field by the 
start of employment. Our intent is to hire at the assistant professor level, but more senior appointments 
may also be considered. Applications are being accepted at Academic Jobs Online: 
https://academicjobsonline.org/ajo/jobs/16642. 


To receive full consideration, complete applications must be received by November 1, 2020. 


Search Contact: Ms. Karen Fosher, HR Administrator, EAPS, 54-924, Massachusetts Institute of 
Technology, 77 Massachusetts Avenue, Cambridge, MA 02139-4307, email: kfosher@mit.edu. 


MIT is an Equal Opportunity/ Affirmative Action employer _ 
http://web.mit.edu 
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WORKING LIFE 


By René S. Shahmohamadloo 


1270 


Mentoring with trust 


hurried downstairs to the cafeteria. At the table sat my new mentees: six eager undergraduates 
who had signed on to work on an 8-month aquatic toxicology project I had devised. It was a crucial 
piece of my Ph.D. research, and it would satisfy a key graduation requirement for the undergrads. 
“Starting today, I get to learn what it’s like to be my Ph.D. adviser,’ I thought to myself excitedly. 
But a few minutes into the meeting, the students broke the news: They didn’t have any training in 
toxicology. My chest tightened. How would this ever work? 


My inspiration to engage under- 
graduates in my research had come 
after 2 years of working as a teach- 
ing assistant. Many of my under- 
graduate students had voiced the 
same frustrations I once had: They 
were expected to absorb facts and 
regurgitate them in exams, rinse 
and repeat, without any real criti- 
cal thinking or opportunity to ap- 
ply what they had learned. I could 
fill that gap, I believed, by creating 
a project related to my own work 
and enlisting undergrads as the 
researchers, guiding them through 
the process while empowering 
them to take the lead. 

My thesis adviser was supportive, 
knowing it would be good experi- 
ence for a principal investigator (PI) 
hopeful like me. My department 
purchased the fish we would study, 
and a government research lab of- 
fered space for the experiments. Everything was in place— 
except for the students’ toxicology training. 

I was worried. But 150 yearling rainbow trout were wait- 
ing to be picked up from the hatchery. Backing out was not 
an option. 

I reminded myself how green I had been when I was an 
undergrad just starting to work with a Ph.D. student. My 
first day in the lab, I was tasked with exposing plants to 
precise doses of chemicals and measuring their responses— 
experiments unlike any I’d done before. Despite my lack of 
experience, my mentor gave me a key to the plant growth 
chambers and walked me through how to set up and 
run the experiment. Then, he left me to it. He assured me 
that he was available to help, but he did not hover over 
my shoulder. 

I spent hours meticulously setting up the experiment— 
and realized 3 hours later, after checking my lab notebook, 
that I had dosed the plants with the wrong concentrations 
of chemicals. I had to throw everything out and start over. 


“Letting the students find 
their own way gave them room 
to grow as scientists.” 


But my mentor was patient. He let 
me make these mistakes so I could 
learn from them and find my own 
way as a researcher. 

Now his example inspired me. 
On the students’ first day in the 
lab, I walked them through the 
facilities and trained them on 
the protocols they would be us- 
ing. Then, I let them be and stood 
by, ready to help. In the first few 
days, I noticed that some forgot to 
calibrate the instruments or didn’t 
follow my instructions for dis- 
secting the fish. My instinct was 
to jump in and save the day. But 
instead, I refrained from interven- 
ing and watched proudly as the 
students identified their mistakes 
and learned from them. 

Later, I put them in the driver’s 
seat when writing up the results 
for publication. The students sur- 
prised me by taking the paper in a different direction than 
we had discussed. Again, I trusted them, and they pre- 
pared an excellent manuscript. 

When we reconvened in the cafeteria for a reflection 
meeting 6 months into the project, the students thanked 
me for not micromanaging them, even though it had been 
scary for them at first. Letting the students find their own 
way gave them room to grow as scientists. And in the pro- 
cess, I also grew as a mentor. 

Good mentorship means trusting your mentees’ capac- 
ity and treating them as more than instruments to collect 
data. I hope that someday I’m able to put this approach to 
use as a PI running my own lab. But it can be employed at 
any level. Good mentorship is good mentorship, whether 
you're a grad student or a PI—and, when given the chance, 
mentees can handle the responsibility. ™ 


VA 


René S. Shahmohamadloo is a Ph.D. candidate at the University of 
Guelph in Canada. Send your career story to SciCareerEditor@aaas.org. 
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